THE HOROCYCLE FLOW AT PRIME TIMES 
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Abstract. We prove that the orbit of a generic point at prime 
values of the horocycle flow in the modular surface is dense in a set 
of positive measure. For some special orbits we also prove that they 
are dense in the whole space — assuming the Ramanujan/Selberg 
conjectures for GL2/Q. In the process, we derive an effective ver- 
sion of Dani's Theorem for the (discrete) horocycle flow. 



1. Introduction 

If (X, T) is a dynamical system, for any 16I one can ask about the 
distribution of points P x = {T p x : p prime} in the orbit 9 X = {T n x : 
n > 1}. For example if X is finite then this is equivalent to Dirichlet's 
Theorem on primes in an arithmetic progression. If (X, T) is ergodic, 
Bourgain [5] shows that for almost all x, T p x with p prime, satisfies 
the Birkhoff ergodic Theorem and hence is equidistributed. If (X, T) 
is 'chaotic', for example if it has a positive entropy then there may be 
many x's for which T p x is poorly distributed in 9 X . For example if 
T : [0, 1] [0, 1] is the doubling map x t— )■ 2x then one can construct 
an explicit (in terms of its binary expansion) £ such that 9% = [0, 1] but 
T p £ as p ->• 00. 

The setting in which one can hope for a regular behaviour on re- 
stricting to primes is that of unipotent orbits in a homogeneous space. 
Let G be a connected Lie group, V a lattice in G and ueGan Ada 
unipotent element, then Ratner's Theorem [31] says that if X = T\G 
and T : X — > X is given by 

(1.1) T(Tg) = Tgu, 

then 9 X , with x = Tg, is homogeneous and the orbit xu n , n — 1, 2, ... is 
equidistributed in 9 X w.r.t. an algebraic measure d\i x . In the case that 
/i x is the normalized volume measure d^c on X it is conjectured in [12] 
that P x = X and in fact that xu p , p = 2, 3, 5, 7, 11,. ..is equidistributed 
w.r.t. dfic- Care should be taken in formulating this conjecture in the 
intermediate cases where 9 X is not connected as there may be local 
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congruence obstructions, but with the 'obvious' modifications this con- 



jecture seems quite plausible, 
of 



In intermediate cases where T is one 



(i) finite 

(ii) a connected circle or more generally a torus 

(iii) a connected nilmanifold T\N, 

P x and the behaviour of xu p , p = 2, 3, 5, . . . is understood. Case (i) 
requires no further comment while for (ii) it follows from Vinogradov's 
work that the points are equidistributed w.r.t dt, the volume measure 
on the torus. The same is true for (iii) as was shown recently by Green 
and Tao [131 EH] j i n order to prove this, apart from using Vinogradov's 
methods they had to control sums of the type e(an[(3n}), which are 
similar to Weyl sums but behave in a more complex way. 

Our purpose in this paper is to examine this problem in the basic 
case of X = SL(2, Z)\SX(2, R). According to Hedlund pi], T x is either 
finite, a closed horocycle of length /, < / < oo, or is all X. The first 
two cases correspond to (i) and (ii). In the last case we say that x 
is generic. By a theorem of Dani [8] the orbit xu n , n = 1, 2, 3, . . . is 
equidistributed in its closure w.r.t. one of the corresponding three types 
of algebraic measures. For N > 1 and x G X define the probability 
measure ir x ,N on X by 



1.2) 



7T(N) 



p<N 



xuP 



where for £ G X, 5^ is the delta mass at £ and tt(N) is the number of 
primes less than N. We are interested in the weak limits v x of the tt x> n 
as N — > oo (in the sense of integrating against continuous functions on 
the one-point compactification of X). If x is generic then the conjecture 
is equivalent to saying that any such v x is One can also allow x, 
the initial point of the orbit, to vary with N in this analysis and in the 
measures in (11.21) . Of special interest is the case x = Tg 



9 



H 



N 



N-2 






JV5 



and u 



1 1 
1 



when H^vi? , < j < N — lisa periodic orbit for T of period N. 
These points are spread evenly on the unique closed horocycle in X 
whose length is N. They also comprise a large piece of the Hecke 
points in X corresponding to the Hecke correspondence of degree N 
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c d 



: ad = N,a,d > 0, b mod d}. 
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We can now state our main results. The first asserts that v x does not 
charge small sets with too much mass, that is u x is uniformly absolutely 
continuous with respect to d\i G . 

Theorem 1.1 (Non-concentration at primes). Let x be generic and v x 
a weak limit of^ Xj N, then 

dv x < 10 d\iQ- 

Remark: As we said before, probably what truly happens is that 
dv x = dfic- If on the other hand we allow n to vary not over primes 
but over almost primes, then the quantitative equidistribution that 
we develop to prove Theorem 11.11 can be used together with a lower 
bound sieve (see [TT], Chapter 12) to prove the density of the orbit. 
More precisely let x be generic, then the points T n x, as n varies over 
numbers with at most 10 prime factors, are dense in X. 

As a consequence of Theorem 11.11 we deduce that P x has to be big. 

Corollary 1.2 (Large closure for primes). Let x be generic, then 

voira > -L 

and if U C X is an open set with Vol(U) > 1 — 1/10 then xu p G U , for 
a positive density of primes p. 

In the case of 'Hecke orbits' Ph n , we can prove more; 

Theorem 1.3 (Prime Hecke orbits are dense). Let v be a weak limit 
of the measures 7Th n ,n- Assuming the Ramanujan/Selberg Conjectures 
concerning the automorphic spectrum o/GL 2 /Q (see for example the 
Appendix in [35],) we have 

1 9 

- dfia <dv<- dfj, G . 

5 5 

Theorem 11.31 has an application to a variant of Linnik's problem on 
projections of integral points on the level 1 surface for quadratic forms 
in 4- variables. Let N > 1 and denote by the set of 2 x 2 matrices 
whose determinant equals N. Denote by 7r the projection A i— > -^A of 
Mtv(K) onto Mi(M). Using their ergodic methods Linnik and Skubenko 
[27] show that the projection of the integer points Mat(Z) into Mi(R) 
become dense as iV — > oo. In quantitative form they show that for U 
a (nice) compact subset of M%(M.) 

(1.3) \{A e M N (Z) : n(A) G U}\ ~ ax(N)fi(U) 

as iV — > oo, where fx(u) is the 'Hardy-Littlewood' normalized Haar 
measure for S , L 2 (1R) on Mi(R) and ai(N) = J2d\N^- P-^P can a ^ so De 
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proved using Kloosterman's techniques in the circle method (23] as well 
as using Hecke Correspondences in GL 2 as explained in [33]. Using the 
last connection we establish the following Corollary. For 

a b 



A 



c d 



G M N (Z 



let ai = (a,N) and d\ = N/a\. Let b\ = bi(A) be the unique integer 
< b± < d\ satisfying b\ = ab mod d\ where ad = 1 mod d\. 

Corollary 1.4. Assume The Ramanujan/Selberg Conjectures for GL 2 . 
Let U be a (nice) compact subset of Mi(R) and e > 0. Then for N 
sufficiently large 

1/5 -e \{A G M N {Z) : n(A) G U, b x {A) is prime }| 9/5 + e 
logiV - \{A G Mjv(Z) : ir (A) i [7}j ~ logA^ ' 

In particular the projections of the points A G Mn(Z) with b\(A) prime, 
become dense in M\{ 



We end the introduction with an outline of the contents of the sec- 
tions and of the techniques that we use. The proofs of Theorems 1 1 . 1 1 and 
ll.3l use of sieve methods. These reduce sums over primes J2 p <n f( xuP )i 
to the study of linear sums over progressions ^2 n<N / d f(xu nd ), and bi- 
linear sums T in <min(N/d 1 ,N/d 2 )fi( xundl )h(xu nd2 ). A critical point in 
the analysis is to allow d to be as large as possible, this is measured 
by the level of distribution a; d < N a (respectively max(d 1 ,d 2 ) < N a 
) . The first type of sums are connected with equidistribution in (X, T) 
and the second type with joinings of (X, T dl ) with (X,T d2 ). 

The effective rate of equidistribution of long pieces of unipotent or- 
bits has been studied in the case of general compact quotients T\SL(2, R 
For continuous such orbits this is due to Burger [7] while for discrete 
ones to Venkatesh [39J. Both make use of the spectral gap in the 
decomposition of SL(2,R) acting by translations on L 2 (T\SL(2, R)). 
One can quantify Venkatesh's method to obtain a positive level of dis- 
tribution for the linear sums and then follow the analysis in the proof 
of Theorem 11.11 to obtain the analogues of Theorem 11.11 and Corollary 
11.21 for such T 's (the constant 10 is replaced by a number depending 
on the spectral gap). 

For T\SL(2, R) noncompact but of finite volume, due to the existence 
of periodic orbits of the horocycle flow one cannot formulate a simple 
uniform rate of equidistribution in Dani's Theorem. In a preprint [38] 
A. Strombergsson gives an effective version of Dani's Theorem for con- 
tinuous orbits in terms of the excursion rate of rate of geodesies; he 
uses Burger's approach. 
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We only learned of [38] after completing our formulation and treat- 
ment of an effective Dani theorem for the continuous flows, see The- 
orems 14.51 and 14.61 in section |H One of the main results of this paper 
is an effective Dani Theorem for discrete unipotent orbits of (X,T S ), 
see Theorems 14.111 and 15.21 These give a quantitative equidistribution 
w.r.t. algebraic measures of long pieces of such orbits and allowing s 
to be large. This discrete case is quite a bit more complex both in its 
formulation and its proof. It requires a series of basic Lemmas (see 
section [2]) which use the action of SX 2 (Z) to give quantitative approxi- 
mations of pieces of horocycle orbits by periodic horocycles, much like 
the approximation of reals by rationals in the theory of diophantine ap- 
proximation. Critical to this analysis are various parameters associated 
with a given piece of horocycle orbit. The resulting approximations al- 
low us to approach the level of distribution sums by taking /, /i,/2 
to be automorphic forms and expanding them in Fourier series in the 
cusp. The burden of the analysis is in this way thrown onto the Fourier 
coefficients of these forms. This leads us to Theorem 14.111 which gives 
a suitable level of distribution in the linear sums and with which we 
can apply an upper bound sieve (Brun, Selberg) and deduce Theorem 
O and Corollary O 

Theorem 11.31 involves a lower bound sieve and in particular a level of 
distribution for bilinear sums. This is naturally connected with effective 
equidistribution of 1-parameter unipotent orbits in (SL(2, Z)\SL(2, W))x 
(SL(2, /L)\SL(2, M.)) which is a well known open problem since Ratner's 
paper [30]. For the special Hecke points that are taken in Theorem 11.31 
our Fourier expansion approach converts the bilinear sums into sums 
of products of shifted coefficients of these automorphic forms ( "Shifted 
convolution"). In Section [3] we review the spectral approach to this well 
studied problem; in particular we use the recent treatments in [31 0] 
which are both convenient for our application and also allow for a crit- 
ical improvement over |35j in the level aspect. Proposition 13.11 gives 
a slight improvement over [I] and also [29], in this q aspect, an it is 
optimal under The Ramanujan/Selberg Conjectures. Concerning the 
linear sums for these Hecke orbits we establish a level of 1/2 (see the 
discussion at the end of section [7]). This is the optimal level that can 
be proved by automorphic form/spectral methods. To analyze the sum 
over primes we use the sieve developed by Duke-Friedlander-Iwaniec 
[TO] . For an asymptotics for the sum over primes (i. e. a "prime num- 
ber theorem") they require a level of distribution of 1/3 for the bilinear 
sums, given the level of 1/2 that we have for the linear sums. Using 
the best bounds towards Ramanujan/Selberg Conjecture for GL 2 /Q 
we establish a level of a = 3/19 for these bilinear sums. This falls 
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short of the 1/3 mark as well as the 1/5 mark which is needed to get 
a lower bound in the sum over primes. However assuming The Ra- 
manujan/Selberg Conjecture this does give a strong enough level of 
distribution to deduce Theorem 11.31 



2. Horocycle Approximation 



The group G = SL(2, M) can be parametrized through its Lie alge- 
bra. The Lie algebra g are the 2x2 real matrices of zero trace. In 
this way, the so-called Iwasawa parametrization g = h(x)a(y)k(9) with 
x,y,9 eR and 



h(x) 



1 x 
1 



a(y) 



IP 








corresponds to the Lie algebra basis 



(2.1) 



R 



1 




H 







k(6) 



V 



cos 9 sin 6 
— sin 9 cos 9 



1 
-1 



in the sense that h(x) = exp(xR), a(e 2u ) = exp(wif) and k(9) 
exp(9V). Moreover it is unique when restricting 9 to [— 7r, 7r). 
We can explicitly define a left G-invariant metric on G as 

( n-l 

d G (g, h) = inf < ^ ip(x i} x i+1 ) : x , ■ ■ ■ , x n G G; x = g; x n = h 



i=0 



with ip(x, y) - 
fix the norm 



mm 



\x 1 y — l x — and || • || any norm. Let us 



a b 
c d 



^/2a 2 + (b + c) 2 + 4c 2 + 2d 2 . 



for concreteness. Then, we can describe the metric in terms of the 
Iwasawa parametrization as 

dx 2 + dy 2 



d G s' 



+ d9 z 



We can also write any Haar measure in G as a multiple of 

dx dy d9 

d\x G = . 

y y it 

Since G is unimodular, this measure is both left and right G- invariant. 

By sending (x+iy, 9) to h(x)a(y)k(9/2) we see that {±I}\G endowed 
with this metric is isometric to the unit tangent bundle TiH of the 
Poincare upper half-plane EI with the metric ds 2 = y~ 2 (dx 2 + dy 2 ). 
We shall use both notations to refer to an element of {±/}\G, and we 
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Figure 1. The horocycle flow 




a 



shall even use x + iy as a shorthand for (x + iy, 0). In this way, we can 
express multiplication in {±I}\G as 

az + b 



(2.2) 



a b 






c d 


(z,e 





,9 — 2 aig(cz + d) 



cz + d' 

Now, we define the discrete horocycle flow at distance s as the trans- 
formation g i — y gh(s). The name comes from the fact that an horocycle 
is a circle in H tangent to <9H, and the horocycle flow sends a point 
with tangent vector pointing towards the center of the horocycle S to 
the unique point at distance s "to the right" whose tangent vector also 
points to the center of S. In terms of our parametrization, we can write 

Rt R 

(2.3) gh(t + cot 6) = h(a — ^ 2 — ^ ) a ( ^ 2 _|_ j0 ^(~ arcc °t t) 
where 

R = y(sin9)- 2 
is the diameter of the horocycle and 

a = x — yW 

its point of tangency with DM, where W = cot 6 (see Figured]). 

Now, we consider the homogeneous space X = T\G, with the metric 
induced from the one in G. Then, we have that X is isometric to 
Ti(r\H). So, considering the identification g = (z,9), we can set 

D x = {(z,9) : \z\ > 1, -1/2 < 3te < 1/2, -vr < 9 < vr} 

as a fundamental domain for X. 

We also have that the horocycle flow in G descends to X, and when 
doing so its behaviour becomes more complex. As we noted in the 
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introduction, Dani proved that for any £ G X the orbit generated by 
the horocycle flow, {£,h(s) n : n — 0, 1, 2, . . .}, is dense in either 

(i) a discrete periodic subset of a closed horocycle 

(ii) a closed horocycle 

(iii) the whole space, 

and we can explicitly state which possibility happens in terms of £ = 
Tg\ (i) for a and sR -1 rational numbers, (ii) for a rational and si? -1 
irrational and (iii) for a irrational. Moreover, in each case the orbit 
becomes equidistributed in its closure w.r.t. the algebraic probability 
measure supported there. 

If one considers the continuous version of the horocycle flow, {£h(t) : 
t G M> }, only the possibilities (ii) and (iii) can occur, depending just 
on the rationality of a. 

Our purpose in sections ES and [5] is to analyze which of the possibilities 
is the "nearest" for a finite orbit 

{£h(s) n : < sn < T}; 

or {£h(i) : t G [0, T]} in the continuous case, in terms of the parameters 
a, si? -1 and T. In preparation for that, we define some quantities 
associated to the piece of horocycle {£h(t) : t G [0,T]}, which will be 
useful for applying spectral theory, and moreover allow us to decide 
between (i), (ii) and (iii) for £. Let Yr(g) the "Euclidean distance" 
from the piece of horocyle P 9t T = {gh(t) : t G [0, T}} to the border dM, 
namely 

Y T (g) = M{y : h{x)a{y)k{9) G P 9 , T } } 
which coincides with the y associated two one of the extremes of the 
piece. Due to (\2.3\i we have 

(2.4) Y T (h(x)a(y)k(6)) x min^, ^), 

where the symbol x is defined as follows. 

Definition 2.1 (Notation for bounds). Let g > 0. We shall use the 
standard notation / = O(g) or / <C g meaning |/| < Cg for some 
constant C > 0; we shall also write / x g as a substitute for / g 
/. Finally, we shall use the notation / < g 0( -^ and / < g~°^ meaning 
\f\ < g c and |/| < (g^ 1 ) respectively, for some constant C > 0. The 
implicit constant C will not depend on any other variable unless in a 
statement that contains an implication of the kind "if fx = O(gi) then 
/2 = 0(g 2 )"; in that case the constant implicit in 0(g 2 ) depends on 
the one in O(gi). 

Now, the key concept is the following. 
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Definition 2.2 (Fundamental period). Let g G G and T > 1. We 

define the fundamental period of £ = Tg at distance T as y^ 1 , where 

y T = y T (Tg) = sup{y T (7£f) : 7 G T}. 

The point of this definition is that we want to approximate our piece 
of horocycle by a closed one. Now, a closed horocycle has the shape 
Ta(y)H with H the closed subgroup H = {h(t) : t G M}; the period of 
this closed horocycle is y~ x — in the sense that Ta(y)h(-) is a periodic 
function of period y~ l . We shall see the closed horocycle of period y^ 1 
is near to our original piece of horocycle {Tgh(t) : t G [0, T]}. 

From the discontinuity of the T action one can deduce that the supre- 
mum in the definition of is actually reached. Moreover, for g G U an 
open dense subset of G one can assure that this happens for a unique 
point g? = h(x)a(*)k(*) with — 1/2 < x < 1/2, of the shape either 'jg 
or r ygh(T) for some 7 G T. This defines the key parameters 9t, cut, 2/t 
and Wt associated to g? (see Figure 

Let us define the following equivalence relation in the space of pieces 
of horocycles of length T: P ~ P' if there exists 7 G V such that 
P' = 7P as sets. We can identify a piece P with its point g which is 
nearest to dM — in case that both extremes are at the same distance 
from the border, we choose the left one. In this way, we can see the 
space of pieces of horocycles of length T as a subset of PSL 2 (M,). Then, 
we have just showed that 

D x ,t = {g T -geUn D x }; 

is a fundamental domain for this equivalence relation. We give an 
explicit description of this fundamental domain beginning by realizing 
yx arithmetically; for that purpose we make the following definitions. 

Definition 2.3 (Torus distance). Let a G M. We define the integral 
part of a, and write [a], as the nearest integer to a. We also define its 
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fractional part as {a} = a — [a]. Finally, we define ||a|| as the absolute 
value of {a}. 

Definition 2.4 (Rational approximation). Let a£l and U > 1. We 

define 

Ku{ot) — niin{m G N : ||77ia|| < U~~ } 

From f)2.4p we know that yxij^g) is always larger than (2T)~ 2 . It 
is natural that we can improve on that by translating g by different 
elements 7 G T. The following result reflects as far as we can go 

Lemma 2.5 (Period realization). Let T > 5. For any g G Dx we have 



2 



y T (Tg) x min(y, RT 2 ) + T 1 min ( U - - ^ 

\Ku{a) \\Ku(o£)a\\J 

with U = (T/R) 1 / 2 . Therefore, for any g G G we have 

y T (Tg) > T' 1 . 

Proof. We can rewrite the T action (12.21) as 

Vl #c 2 + 2cecos# + e 2 ' 7 (ca + d) 2 ' 

where 7 is the matrix there — with R 1 and y 1 the corresponding pa- 
rameters associated to 75 — and e = ±(?/-R 7 ) _1 / 2 , the sign given by the 
one of (ca + d) sin^. Let us first treat the case R > T. We want to 
show that yx x mm(y, RT~ 2 ). If y < RT~ 2 , since g G -Dx it is clear 
that g = gx- If 2/ > RT~ 2 , let us suppose that j/t > CRT~ 2 for a large 
constant C. If #t equals either jg or 7g/i(T), we deduce from (12 .4p 
that 3> Ci?, so that writing gx = jg we have d = — [ca]. Therefore 

and since i? 7 y 3> CRy ^> CTy ;§> CT > C we get that y 7 <C 
c~ 2 R~ l -C RT~ 2 which is in contradiction with our assumption. 

Now let us treat the case T > R. Choosing c = kjj(o) and d = 
— [njj(a)a\ in the previous formulas, we have i? 7 = _R||fC;y(a!)a!|| -2 > T 
and then y 7 x ^(a) -2 /^ 1 . Considering (I2.4p this clearly implies 

Y T {ig) x min( — jK^, 7 , ||2 )- 

which equals the second term in the sum of the Lemma's statement. 
Since yxiTg) > Yr(jg), it only remains to prove that yxiTg) < CYp^g) 
for some constant C > 1. Let us suppose this is not the case; then there 
exists 7* G T such that ir(7*fl0 > Clr(750 and since Y T {^g) 3> T _1 
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by (12. 4 p it follows that \\c*a\\ < U 1 which by definition of ct implies 
c* > Or- Also we could repeat the previous reasoning to show that 

" TMXmi 4'^f ) ' 

Finally, one can see that either Ku(a) divides c* or c* x U, but then 
5t(7*<?) — 0{Yt{iq)) which is a contradiction. □ 

Can we get something better than the bound 0(T) for the fundamen- 
tal period? Not in general, think for instance of the case g = a(T _1 ): 
then the piece of horocyle is just the closed horocyle of length T and 
Ut = T~ l in this case. This is not a coincidence, as the following result 
shows. 

Lemma 2.6 (Domain description). For any T > 2 and c > 0, we 

define the set 

B C (T) = {h(x)a(y)k(6) : -1/2 < x < 1/2, y' 1 < Tc' 1 , \6\ < T^c' 1 }. 
We have that 

B Cl (T) C D XyT C B C2 (T) 
for some positive constants Ci, Ci and any T > 2. 

Proof. The inclusion Dx,t C B C2 (T) comes just from Lemma [2.51 and 
the fact that by (12. 3p the lowest point in any piece of horocycle of 
length T has 9 = 0{T^). 

Let g = h(x)a(y)k(9) G B Cl (T) for some large c\. Suppose that 
g Dx,t, then there exists 7 G T with c = c 7 7^ such that -yg G Dx,t- 
So, since g and g' = h(x')a{y')k{6') := gh{T) are in P 9i r, by Lemma 
12.51 we have 

min(a(7#),a(7#')) = y T (g) > ^ 

for some e > 0. On the other hand, one can check that \x' — x\ ^> yT 
and y' x y. Therefore, by (12. 2p we have 

min(a(7#), a(jg')) < — < — < 



max(|ca; + d\, \cx' + <i|) 2 ' |c| 2 |x — x'\ 2 yT 2 
which is O(^), giving a contradiction for c\ large enough. □ 

Remark: This lemma implies that in any case, a large part of the 
piece of horocycle lies at height 0(yr), thus showing that it is near to 
the closed horocycle of period y^ 1 . Moreover, it says that the funda- 
mental domain is essentially 

|x|<l/2, y -1 <T<|W| 
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with y~ l describing the period of the associated closed horocyle and 
W = cot 6 measuring the distance to it; 6 = being the extreme case 
in which the piece is actually a closed horocycle. 
From (12.31) we can write 

gh(t + cot 9) = h{a - Rr^Rr 2 ) + 0{t~ l ) 

meaning that both points are at distance 0(t _1 ) in G. Since can be 
either ^g or / yh(T), we can always parametrize any piece of horocycle 
of length T as 

<- + if^)«(a±fe) +0 (rS^) teM - 

As we said before, we shall understand the piece of horocycle in terms 
of the parameters ar,yr and Wt (and s in the discrete case). But we 
are mainly interested in a fixed Tg and letting s change, and in that 
situation we will be able to express the results just in terms of a and 
sR^ 1 . To do that, we need to relate both kind of conditions. First we 
write the necessary result for the continuous orbit. 

Lemma 2.7 (Fundamental period and continuous Dani). Let < 5 < 

1/2 and g G G with coefficients bounded by Then, yx < 5~°^ 

if and only if there exists an integer q < 5~°^ such that \\qa\\ < 
§-o(i) T -i_ 

Remark. For our application to orbits at prime values we will 
always have = (logT)" 4 for some constant A > in this lemma as 
well as in later statements. Moreover, throughout the whole paper we 
can assume that (<5 _1 ) c < T for some large constant c > 1, because 
otherwise the results are trivial. 

Proof. Considering the conditions in the statement, the result is just a 
consequence of Lemma 12.51 □ 

Now we write the analogous result for a discrete orbit. Probably it is 
better to skip it on first reading, at least until one arrives at Theorem 
14.111 Before, let us fix our notation for inverses modulo a number 

Definition 2.8 (Modular inverses). Let q be an integer different from 
zero. For any a G Z coprime to q, we define a as the integer between 1 
and q such that aa = 1 mod q. 

Lemma 2.9 (Fundamental period and discrete Dani). Let N > 1, 

< 5 < 1/2 and s > <5 _1 A^ _1 . Let g G G with coefficients bounded by 
Then the following statements are equivalent: 
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(i) There exists q G N and 7 G T with the coefficients of 7 and q 
bounded by (5~ 1 r(g 2 ))° ( ' 1 - ) such that g = 7g satisfies 



l^ll < (rVfe)) ^- 1 , 



a|| < (r 1 r(g 2 ))°( 1 )(siV 2 )- 1 . 



where g 2 and g 2 are the denominators in the expressions as a 
reduced fractions of an d j^f? respectively. 

(ii) There exists y < 1 and an integer q' with y~ ^ /{j(q' 2 )5 < 
q' < y-h (t(? 2 )5- 1 )° (1) snc/z too* r# = r(x + iy, 0) 

ll^ll + JVUg'syll + (siV) 2 g'% < ^(i" 1 ^)^, 

where q' 2 is the denominator in the expression as a reduced 
fraction of . 

Remarks. The proof actually gives g 2 = q 2 - One can check that 
conditions in (i) assure that the coefficients of 7 and q are always 
bounded by (1 + s)°^S~°^ . One can see in the proof that from (i) we 
actually get a y in (ii) satisfying y 3> (1 + s)~ 2 ~ o(1 ). 

Proof. Let us begin by demonstrating that (ii) implies (i). Let us write 

explicitly the Iwasawa decomposition 

(2.6) 

— xy 2 sin# + cos 9 xy 2 cos 9 + yi sin 9 
—y~z sin 9 



g = h(x)a(y)k(9) 
or equivalently, with W = cot 9 
(2-7) 5 = 

From (ii) we have that 



y 2 cos 9 



-R-2a R'zaW + Rz 
-R-* R-^W 



x = ^ + yO(M) q[ I q>, (a[, q[) = l,M = (r(q' 2 )5~ 1 )°«. 

On the other hand, conditions in (ii) always imply y^ 1 < 5~°( 1 \s 2 . 
Then considering the matrix 



<k -a'i 



in T, we have 

9* = la'Jq[9sN = 



-a[y^(l + (yq[a' 1 )~ 1 90(M)) * 

q[yHl + 90(M)) q' iy ^O(M\ 
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Since \9\ < M(sN)~ 2 , this gives 



R- 1 = q[ 2 y(l + 



0{M), 
s 2 N 2 ■ 



a 



a[ 1 0(M) 



+ 



q[ q' 2 y s 2 N 2 



with a = a(g*), R = R(g*) — the corresponding parameters associated 
to g. Also gives yig*)' 1 < Mq' 2 y < M, so if q' 2 y < M" 1 then g* is 
in the fundamental domain, but then since go is bounded this implies 
qfy > M" 1 . Now, 



7d = 9i(? s ?/) 



O(M) 
siV 2 



s M s 
l^ll < ^ , [^1 



so that g = <// q[ < M satisfies 

q'Aq'sy] 

and also, using the expression for a, we have 

-a\[q'sy], 

On the other hand 



r s M 



qq'{ 



Q 



so ^2 = Qi and ^2 = 4i- Finally, we have that the coefficients of the 
matrix g* are bounded by M, so since Tg* = Tg we have g* = jgo, 7 
with coefficients bounded by M and (i) follows. 

Now let us prove that (i) implies (ii). Let s = s(g),R = R(g) and 
W = W{g). We have 



(21 



R 



r s i M 

[q R ] + N ' 



a 



a 1 
Qi 



+ 



M 

Jn 2 



with M = (T(q 2 )5 for some coprime integers a%, qi, with q\ | 

We also have = y(g)~^ < M. Let us consider g* = 'y ai /q 1 g = 

{x + iy,0)k{6). Via (EZJ) 



(2.9) 



qiM 
S 2 N 2 



i?^l(l + ^) 



so considering the components in the lower row we have (by (12.61) ) 



tan 01 < 



M 
s 2 N 21 



Rq( 



1 + 



M . 
s 2 N 2 ' 
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so taking q' = qq\ we have 



, _ 1 8 ( . M _ [gi] M. _ [g|] , M 



g'sy = 1 + -^=) = ^ 1 + — ) = ^ + 



gi iT s 2 iV 2y g x v siV y q t qi N 

so ||g's?/|| < My^N" 1 . We also have M^ x y^ < q' < My~^ and 
(siV) 2 g'2/|6!| < y^M. Now, by the second column in (12. 9p and by (12. 6 j) 



we have 



i 



x + ytan# = — ^ — — A " 1<?1 - = — — + 



^g x (l + ^) Qi It 
so g'x = — qai + y^M and then ||g'x|| < My^. Finally 

oi = [[giH jgjy] = 
gi [g|] ' g' gg? 

so gi = g 2 and q' 2 = g 2 . □ 



3. AUTOMORPHIC FORMS 

A key to our analysis of the averages of functions along pieces of long 
periodic horocycles is the use of automorphic forms. We will need the 
sharpest known estimates for periods of the type 

(3-1) f f{ 

Jo 

where h G Z, g G G and / is a mildly varying function on Ll(T (q)\G) 
and q > 1 is an integer. Here the subzero indicates that J Fo ^ G /(g) dg = 
and T (g) denotes the standard Hecke congruence subgroup of SX 2 (Z). 

If h — and g = a(y) then (13. ip measures the equidistribution of 
the closed horocycle of period 1/y in r (g)\G. This can be studied 
using Eisenstein series and the precise rate of equidistribution is tied 
up with the Riemann Hypothesis (see [32J for the case q = 1, the rate 
is O e (yi +e ) if and only if RH is true). For h ^ the size of (13. ip is 
controlled by the full spectral theory of Ll(T (q)\G) and in particular 
the Ramanujan/Selberg conjectures for GL 2 /Q (see the appendix to 
[35]). In this case ( 13. ip is closely related to the much studied shifted 
convolution problem. In order to bench-mark the upper bound that we 
are aiming for, consider the case that / G LQ(r (g)\G) is K invariant, 
that is f(gk(9)) = /(g). Thus / = f(z) with z = x + xy G H and AO) 
is the period 



g)e(-hx) dx 



(3.2) 



I f(x + iy)e(—hx)dx. 
o 
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We are assuming that / is smooth and in LQ(T (q)\M). We use the 
Sobolev norms 

(3.3) \\f\\ 2 w . d = [ \f(z)\ 2 dA(z)+ f \A d f(z)\ 2 dA(z) 

where d > is an integer, dA = is the area form and A the 
Laplacian for the hyperbolic metric. 

Expanding / in the Laplacian spectrum of r (g)\IHI (see [19]) with 
(f)j an orthonormal basis of cusp forms and Ej(z,s) the corresponding 
Eisenstein series, j — 1,2, ... , u(q), yields 
(3.4) 

u 1 f°° 1 1 

f(z) = Y,{f^j)M") + T l 2~ / U,E,{;- + it))E 3 {z,- + it)dt. 



Note that 4>q{z) = l/y / Vol(r (g)\H) does not appear since we are 
assuming that 



(3.5) / f(z)dA(z) = 0. 

Jr (q)\u 

Let Xj denote the Laplacian eigenvalue of <fij and write Xj = \ + t|. <pj 
may be expanded in a Fourier series (see |19j ) 

(3.6) (j)j(z) = ^2p j {n)y^K itj (2Tr\n\y)e{nx). 

Using the Atkin-Lehner level raising operators one can choose the or- 
thonormal basis to consist of new forms and old forms and then 
normalizing the coefficients in ( 13 .4[) amounts to bounding the residues 
of L(s,4>j x (f>j) at s — 1. These are known to be bounded above and 
below by (Xjq) ±e respectively (see [UJ [20] ). In this way one has the 
bound (see [22] for details when q is square free which essentially is the 
case of interest to us, and [31 H] for the general q) 

i irf ' 

(3.7) Pj (h) <e (XjqhYqS cosh(^)// 

for any e > 0, and where 9 is an acceptable exponent for the Ramanu- 
jan/Selberg Conjecture. 9 = 7/64 is known to be acceptable [24] while 
9 = is what is conjectured to be true. 
It follows from ([317]), fl33D and (El} that 

f f( x+i y)e(-hx) dx « e yiMpL \K ltj (2n\h\y)X*\ cosh(^) +cts 

Jo \/<l 7?c 1 
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where the term "cts" is a similar contribution from the continuous spec- 
trum and for which 9 = is known since the coefficients of Eisenstein 
series are unitary divisor sums. 

The Bessel function K satisfies the inequalities, say for v <C 1 (see 

m 

K v (v) < e v- u - e 0<v< 1/2, 

(3.9) K it {v) < e v~ e 0<t< 1, 

e^ l K it (v) < e v ~ e 1 < t < oo. 

Hence from (13.71) we have that for < \hy\ <C 1 
(3.10) 

t f(x + it)e(-hx) dx « e y~ 2 ~ e ~y h y J2 I (f, ft) I A} 
Jo VI m 

Weyl's law for r (g)\EI gives the uniform bound 
(3.12) J] 1< Vol(r (g)\H)A 

A,<A 

for A > 1 and q > 1. Hence 
(3.13) 

J2 X f +2e « Vol(r (g)\H) = qH(l + -)Vol(r (l)\H) « q l+ \ 

j^O p\q P 

We conclude that for \h\y <C 1 and e > 

(3.14) / f(x + iy)e(-hx)dx « e ^^Yll/lk- 
Jo 

For 6 = (I3.14p is sharp, that is it cannot be improved. To see this 
take for example h = 1 in 

(3.15) / f(x+iy)e(-hx)dx = y^(f,^ j )p j (h)K itj (2ir\h\y)+cte. 
Choosing 

f{z) = PjiX) Ku^y)^ 

0<tj<l 

yields 



3\ Z ) 



^ 2 x \p^)\ 2 \Krtp*y)\' 

0<t,<l 
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2 

w 2 - 



while 
(3.16) 

I f{x + iy)e{-hx)dx = y l2 ^ \p 3 {l)\ 2 \K ltj {2ixy)\ 2 x y 

^° 0<tj<l 

Recall that Iwaniec [20] shows that 

(3.17) \ Pj (l)\ » e (A, g )-cosh(^)/ v ^, 

hence changing y a little if need be to make sure that 11^(27^)1 3> 1 
for most tj < 1, we see that for this / 

(3.18) ||/|| w2 > e (gA,)- 6 . 

It follows from fl3TT8|) and ([316]) that (ETT1) is sharp when 9 = 0. 

On the other hand if 9 > then ( 13. 14ft can be improved for q in the 
range y~2 +2e < q < y~2 . To do so we estimate the j-sum in (I3.16P using 
the Kuznetsov formula for r (g)\IHI, rather than invoking the sharpest 
bound ( 13. 7p for the individual coefficients. One applies Kuznetsov with 
suitably chosen positive (on the spectral side) test functions an then 
using only Weil's upper bound for the Kloosterman sums that appear 
on the geometric side of the formula, we get (see [20]); for X > 1, h ^ 

(3.19) IP^)| 2 * 4N <<1 + — 



0<A J <i 



q 



and for A > 1 

(3.20) |p J W| 2 cosh( 7 rt J )<A(l + ^). 

!<A f <A 

Now for < \hy \ >C 1, using (13. 9p and (I3.15P we have 
(3.21) 

f f(z + iy)e{hx)dx<yt{ ^ \ Pj {h)\ 2 \hy\-^{ £ |(/ - x |L ' 

J 



,Y 3 /\ J 2 



+^(Ei^( /i )i 2 i^r 2 )nEK/>^)i 2A ?) i 



Taking X = \hy\ 2 in (13.191) and applying it to the first term on the 
right hand side of (13.211) and applying (I3.20p for the second term we 
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arrive at 
(3.22) 

L i y~i \h\~& 

f(x+iy)e{-hx)dx<^yi\\f\\ W 2{l+^+i-^-)<£y* llJllW 2^ 

since \h\ <C y~ l . 

We combine (13. 14ft with (I3.22j) to arrive at our strongest uncondi- 
tional estimate; for < \hy\ <C 1 and e > 0, 

(3.23) / f(x + ty)e(-hx)dx^ e (y-\y y ^\\f\\ w imm(y- e ,l + y-^). 
Jo 

For the application to the level of equidistribution in type I and II 
sums connected to sieving as we do in section [3, ( I3.23|) can be improved 
slightly when summing over h in certain ranges; we leave that discussion 
for section [7J 

In generalizing f!3. 141) and (I3.23f) to functions on To(q)\G it is natu- 
ral to use the spectral decomposition of Ll(T (q)\G) into irreducibles 
under the action of G by right translation on this space. This is the 
path chosen in [3] and [I] and we will follow these treatments closely 
modifying it as needed for our purposes. 

The Lie algebra q of G consists of the two by two matrices of trace 
zero. An element X of q gives rise to a left invariant differential oper- 
ator on C°°(G); 

(3.24) D x f(g) = j t f(gex P (tX)) t=0 . 

The operators D H , D R , D L with H,R,L = R — Vm (12. ip generate the 
algebra of left invariant differential operators on C°°(G). For k > 
define the Sobolev fc-norms on functions on ro(q)\G by 

(3-25) ||/lk*:= E WW 

ord(D)<fc 

where D ranges over all monomials in Dh, Dr and of degree at 
most k. For a unitary representation tc of G, g acts on the associated 
Hilbert space V n and one can define Sobolev norms of smooth vectors 
in the same way. The center of the algebra of differential operators is 
generated by the Casimir operator u which in our basis is given by 

(3.26) uj = ~(D H D H + 2D R D H + 2D H D R ). 

In Iwasawa coordinates it is given by 

(3 - 27) " = -^ + w )+v ^- 
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We decompose Lg(r (g)\G) under right translation by g G G into 
irreducible subrepresentations it of G. This extends the decomposition 
in (HOD to 

(3.28) L 2 (T (q)\G) = [ K^(vr), 

where it ranges over G the unitary dual of G and /x is a measure on 
G (it depends on g of course) which corresponds to this spectral de- 
composition. It consists of a cuspidal part on which the spectrum is 

discrete (cuspidal J* /( n ^ g) dx — 0, V<?) and a continuous part 

corresponding to an integr; 
to for / G L§(r (g)\G) 

(3.29) /= lJ„dn(ir) 
and 



l j 

corresponding to an integral over unitary Eisenstein series. According 



G 



(3-30) \\f\\h(r n M\G)= jUfvd^n). 



G 



Since it is irreducible and u commutes with the G-action it follows 
that u acts on smooth vectors in V n by a scalar which we denote by A^. 
Weyl's law for the principal and complementary series representations 
of G together with the dimensions of the discrete series representation 
in L 2 (T (q)\G) imply that for T > 1, 

(3.31) fi{iT : |A,| < T} < Vol(r (g)\G)(l + T). 

Note that for a, b non-negative integers and / G V n smooth we have 
from ojf = X n f that 



(3.32) < (l + A*)- 



Using the Hecke operators, Atkin-Lehner theory and choosing suitable 
bases to embed the space of new forms of a given level t | q to level q 
(see [22111]) we can further decompose Ll(T (q)\G) into an orthogonal 
direct sum/integral of irreducibles on which the Fourier coefficients 
satisfy the analogue of (13. 7p . In more detail if it is an irreducible 
constituent in the above decomposition and it is of level q (by which 
we mean that has a vector which is a classical modular new form of 
level q) then the Whittaker functional 

(3.33) W f (y):= f f(h(t)a(y))e(-t)dt 
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is non-zero as a function of / a smooth vector in V n . Moreover as is 
shown in [I] 

(3 ' 34) vaS\gr *< w, '- iy '> 

where the second inner product is the standard inner product on L 2 (W, 
and in the Kirillov model for n and satisfies Q 

(3.35) (X w q)- e < e c n < e {KqY- 

Moreover the m-th coefficient (m ^ 0) for our period integral satisfies 
the relation; for / G V n 

(3.36) f f(h(t)a(y))e(-mt) dt = ^^- Wf (\m\y). 



Here A 7r (m) is the eigenvalue of the m-th Hecke operator on V n . Recall 
that we are assuming that these satisfy 

(3.37) K{m) < T{\m\)\m\ e 

and similarly that the Laplace eigenvalue = \ + 1 2 (in the case that 
it is spherical), if it % > then 

(3.38) it n < 9. 

The invariant differential operators on induce an action on Wf, 
namely 

D x W f = W Dxf 

and using D H = 2y±, D R = 2my and D L = ^-(^ + y£r) we get 
using the various normalizations and (I3.34p . (I3.35p . (I3.36p . (I3.37P and 
fl3T38|) (see 0]) that for b > fixed, m/fl and / G K 



(3.39) / f(h{t)a(y))e(-mt)dt^ b 



X w q 1 T{\m\)y* 



my y/q 1 + \my\ 



w 4 + b - 



As mentioned above, in [4] it is shown how this analysis may be ex- 
tended to any n not just the ones of level q. The same bounds (I3.39f) 
may be established for the Eisenstein spectrum and in that case one 
can take 6 = 0. Hence if / G LQ(T (q)\G) is smooth we may apply 



1 note our normalization of the Sobolev norms on C°°(To(q)\G) and that of [3] 
differ by a factor of Vol(r (q)\G) 
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(EEZg) , dS30D and (EM]) to arrive at 

f(h(t)a(y))e(—mt)dt = / f w (h(t)a(y))e(—mt)dtdfi(ir) 

J 8 Jo 

\my\ e 1 + \my\ 



G 



(3.40) « e>6 1 JL^(H)r 



my 1 + |my| 

For m = only the Eisenstein series enter and a similar analysis of the 
constant term yields that for smooth /'s in Ll(T (q)\G) 

(3.41) / f(h(t)a(y)) dt « tfy^WfWw*- 

Jo 

(I3.40p and ( 13.41 j) are the generalizations of (I3.14p to T (q)\G and as 
we have noted these bounds are essentially contained in jl]. The slight 
improvement ( I3.23|) can also be incorporated into this T (q)\G analysis 
and this gives our main estimate for the period integral: 

Proposition 3.1. For e > 0, b > fixed and 9 admissible for the 
Ramanujan/Selberg conjecture for GL 2 /Q, if f G C°°(T (q)\G) and 
m GZ, then for m > 

q . e T(m)y5\\f\\ w6 +b _ 1 y~ 



/ f{h(t)a(y))e(-mt) dt < ejb ( ) — -= mm(y 

Jo m y i + \ m y\ 

while for m = 0, assuming J x f dfiQ — 0, 

f{h(t)a{y))dt<^ e (qy- 1 )^ WfWw- 



V e y e qi 



Remark. For Y = T (l) the inequality (I3.38j) is satisfied for 9 = 0, 
and then in the bound for m > in Proposition 13.11 we can substi- 
tute y~ e by m e (see [3]). For n continuous we can substitute y~ e by 
min(l, \U\) (see 0E]). 

To end this section we record some bounds for sums over Hecke 
eigenvalues X n (n), that will be needed later. We restrict to T = r (l) 
the full modular group as this is what will be used. For such a 7r is 
either a fixed holomorphic form or a fixed Maass form it is known that 
(for the holomorphic case it is classical while for the Maass case see 
PI): for T > 1 and x G R 



(3.42) K(m)e(mx) Tr 



m<T 
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We need control on the dependence in ( 13. 42ft of the implied constant. 
This can be done in terms of the eigenvalue A^ of 7r. It is also convenient 
for us here and elsewhere to work with the smooth normalized sums; 

\ X(m) 

(3.43) y, — i=e{mx)ip(mu) 

m v 

for u > and small. 

We can use the set up above with Whittaker functions to bound the 
sums in (I3.43p . For / G V n one can control the L°°-norm of / by its 
Sobolev norms (see [5]): 

(3-44) ll/IU c«A^ 1 )||/|| w a 

for ii discrete, with A^ = max(l, [ A-^- 1 ) , and 

(3-45) sup IMI <<A^ 1 ) \\f\\ w3 

g=h(x)a(y)k(8)eG V 1 + V 

for 7r continuous. 

Let Ti be discrete. Suppose that Wf has support in (1,2) for some 
/ G V n . From (I3.34p and the action of the differential operators on Wf 
we deduce 

(3-46) Il/H^ « 6 A»+°« \\W f \\ w2b , 

with the norms for Wf being the usual Sobolev norms for real functions. 
On the other hand, for any if) G C£°((l,2)), there exists / G V % with 
Wf = if). This comes from the fact that if / G 14 then f g (x) = f(xg) G 
K for any g G G, and W fa{y) {u) = W f (yu), W fh{x) {u) = e{x)W f {u). 
Then, due to (13.441) and (13.461) . for any < u < 1 we have 

(3.47) V ?^e(mx)1>(mu) = f(h(x)a(u)) « \l + °M\\if>\\ w6 . 



In the same way we have @ 
(3.48) 

El^l 2 !^)! 2 ) 2 = (f\fW)a{u))\*dty <<A* +0 «| m |^. 

For tt continuous, the suitable norm for the Hilbert space V n satisfies 
min(l, 1^1) ||/|| v^r x ll'W/IU 3 an d again from the action of the differen- 
tial operators on Wf we have 

(3.49) min(l, \t„\)\\f\\ wb « 6 ~\ b +°W\\W f \\ w *,. 



2 Note that in terms of the dependence in A,r this is much weaker that Iwaniec's 
°!> J2 m <T \K{m)\ 2 < e T\% and its extensions (lTgj) and (T7TU1) 
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Proceeding as in the discrete case, from (I3.45P and (13.491) and sending 

h(x)a(u) to the fundamental domain gives 

(3.50) 

mm(l,\t w \)y^0-e(mx)ilj( m u) < min(— T ^-)A^ «|^|| W 6, 



m q \\qx\ 

in 

where q is the natural number smaller than u~ 1 ^ 2 for which the quan- 
tity in the formula is the largest. This last formula can be improved 
(by using Poisson Summation in M 2 , or by Voronoi formula) whenever 
1/| 1 5a;| | > l/uq, in the sense of adding the decay factor 

/ V||gg!K -6 
l/uq 

for any fixed b > to the right of the inequality — but paying with a 

factor O b (A? (b) ||'0||wo( b ))- 

Finally, one can also improve on ( 13.481) in the following way (see |36j) 

(3.51) K(m)| 2 « e A^L 

M<m<M+L 

for any L > M 3 ^ 5+e for any e > 0. 

4. Effective equidistribution: non-discrete algebraic 

measures 

In this section we prove effective versions of Dani Theorem. We begin 
with some useful notation. Since we are going to talk about probability 
measures, it is a good idea to normalize sums. 

Definition 4.1 (Expectation). Let J C M be a finite set. Let / : R — > 
C. We define the expectation of / in J, and we write it as 

as XLeJ f( x )/ ^LeJ 1- If is a bounded subinterval of WL, we define 
the expectation in the same way but using integrals. 

To deal with bounded functions that vary slowly we use Lipschitz 
norms. 

Definition 4.2 (Lipschitz norm). Let / : X — Y C. We define the 
Lipschitz norm of / as 

\f{x)-f{y)\ 



Lip = / L~ + SUp . 

xjtyex d x {x,y) 

Finally, we define the quantitative concept of equidistribution that 
we shall use to describe our main results 
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Definition 4.3 (Effective equidistribution) . Let < 8 < 1/2. Let 
g : J — > X, with J either a sub interval of Z or of R. We say that 
g(x),x G J is 5-equidistributed w.r.t. a Borel probability measure \i 
on X if 

|Ex 6 j/(<7(aO) - / /^|<*||/||up 
for any / : X — > C 

Remark. This concept of quantitative equidistribution is similar 
but weaker to the one used by Green and Tao in [13J. 

The Lipschitz norm controls the Sobolev norms in the following 
sense: let 5 > 0; for any / G C(X) with ||/||Lip = 1 there exists 
another function f$ G C(X) such that 

(4-1) \\f - fs \\ Loo < 5 

and H-D/jU^oo ^C rdD 5~ ordD for any left-invariant differential operator. 
This will allow us from now on to substitute a function with bounded 
Lipschitz norm by one with "bounded" Sobolev normal We can build 
fs as follows: for two functions f,w G L 2 (SX 2 (M)) define its convolu- 
tion as f*w(g) = / 5i2 ( K ) f{gt)w{t~ 1 ) dfj,(t); if / is T-invariant then so is 
f * w. Moreover we can also write / * w(g) = f SL2 m\ f{t) w {t~ l g) dfj,(t) 
by the left invariance of d\x. Pick ip G Cq°((0, 1)) a non-negative func- 
tion and define ips(g) = csip(S~ 1 dsL 2 (M.)(g^ I)) with cj the constant that 
gives j SL2{R) ^Psig' 1 ) dfi(g) = 1. Finally, defining f s = f*^> s one can 
check that satisfies the desired properties. 

Before beginning with our quantitative results, let us recall the fol- 
lowing lemma on cancellation in oscillatory integrals. 

Lemma 4.4 (Integration by parts). Let A > 1. Let 77, F G C£°(l,2), 
77 with bounded derivatives and \F'\ x A, F^ <tij A for any j G N. 
Then 

J r)(t)e(F(t))dt<z:A~T^. 

Proof. Write F(t) = Af(t) and integrate by parts several times. Since 
the derivatives of / are bounded, the result follows. □ 

We are ready to give our first quantitative version of Dani's Theorem 
for a continuous orbit. 



3 Note that in Sections 0] [5] and [5] the entire discussion takes place on X; that 
is T = 6X2(2). It is only in Section [7] that the level q of congruence subgroups is 
relevant. 
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Theorem 4.5. Let < 5 < 1/2, T > 1. For any (6l there exists a 
positive integer j < 5~°^' such £h(t), t in any subinterval of [0,T] of 
length T/j, is 5-equidistributed w.r.t. a non-discrete algebraic measure 
in X. Moreover, it is the volume measure unless y T x < 5^°^\ 

Proof. Let us suppose the statement is false. We can assume that <5 _1 is 
not larger than a small power of T — otherwise the result is void — , that 
the Lipschitz norm of / is 1 and that it has 5~°^ 1 - ) -bounded Sobolev 
norms. Because of (I2.5p . there exists y* > yr/o~ 2 and r\ 6 Cq°(IR) with 
IMIw^ < 5^°^ and support outside of |1 ± tTW^ 1 \ < 5 such that 

\E tV (t)f (h(a T + r g^ W))l » 8. 

where fo equals / — f(h(x)a(y*)) dx in the case yr <C 8~°^ and 
f — f x f due otherwise. Notice that this implies, by Proposition 13. II in 
the second case, that fo(h(t)a(y*)) dt <C 5 2 . But then, expanding in 
Fourier series fo(h(x)a(y)) = J2 m /o( m i y)e{mx) we get 

£l/oKy*)|| / V(t)e( 1 2 y t ^-i )\» 6 - 

The derivative of the phase in the exponential is of size |m|y<rT 3> \m\, 
and then integrating by parts (Lemma I4.4j) and using Proposition 13.11 
we have for any large b > that 

oo 

Yl yT 2 r{m)m\my T T)- b > fe 8. 

m=l 

This implies that S^^T' 1 / 2 ^ 5-°^y]l 2 {y T T)- b > 6 5, which is a 
contradiction. □ 

Remark: With the same method we could actually show that the 
integral J Q T f(Tgh(t)) dt equals J x f d(i G + 0{y l J 2+o{l) )\\f\\ w o(v h which 
is comparable to Theorem 1 in [3H] for the particular case V = SL(2, Z). 
This result could also be proven by using quantitative mixing, as done 
in for T\G compact but taking care of the position of the orbit. 

When we consider Tg fixed, we can make the result more explicit. 

Theorem 4.6 (Effective Dani, continuous orbit). Let £ G X fixed. The 
non-discrete algebraic measure appearing in Theorem \4.6\ is the volume 
measure unless there exists an integer q < 5~ 0( -^ such that 

\\qa\\ < 5-°^T-\ 
Proof. This follows from Theorem 14.51 and Lemma 12.71 □ 
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Remark. Here and hereafter whenever we speak of "algebraic mea- 
sure" it will mean one of the three algebraic measures appearing in 
Dani's Theorem (described in section [2]). 

As a corollary of this result we obtain Dani's Theorem for the con- 
tinuous flow: if a is irrational, for any 0<5<l/2we can find T = Tg 
large enough so that no q exists as in the statement, and then Tgh(t), 
t G [0, T] is 5-equidistributed w.r.t. the volume measure. 

Next we begin our study of the discrete orbit £h(s) n , n = 0, 1, . . . N. 
Our aim in the rest of this section is to understand, in terms of £ 
when any large piece of this orbit is 5-equidistributed w.r.t. a non- 
discrete algebraic measure. We will be able to do it for not very large 
s; essentially in the range s < (sN) 1 ^ 5 . 

In the proofs we shall use the following version of the Stationary 
Phase principle (a more precise one can be found in [18]): 

Lemma 4.7 (Stationary Phase). Let rj G Cg°((l,2)). Let A > I, 
F G C°° with F" x A and A for j > 3 in the interval (1/2,4). 

Then, we have 

r){t)e{F{t)) dt = V *{t F )e{F{t F ))A- 1 / 2 + O(vTR^) 

where t F is the only point where F' vanishes, and r/* G Cq°((1/2, 4)) 
depends just on A and F" , and r/* and its derivatives are bounded (in 
terms oft] only). 

Proof. First, let us write 

r){t)e(F(t))dt = e(F(t F )) J rj(t F + s)e(J(t F ,s))ds 
where 

J(t,s) = F(t + s) - F(t) - F'(t)s = / F"(t + v)dvdu. 

Jo Jo 

Let us choose d = A 1 / 8 . For some i[) G Cg°((-2, -1) U (1, 2)) we can 
write 

oo 

i=-t» 1<2J<C 1 

with ipo and ipi coming from the sum over TP < 1 and TP > G\ respec- 
tively. Decomposing the integral accordingly, one can easily show that 
the part corresponding to ipi is bounded by G x 1 ^ o( - 1 ^ < ^-VK 1 )^ and 
the rest can be expressed as 

A~ 1/2 e(F(t F ))r]*(t F ) 
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with 

V*(t)=Vo(t)+ E M*)> 

K2J<Ci 

where 

^W = ^ 1 ^(M)e(C 2 J c (t, S ))cfe. 

with s) = ^(s)?7(t + CA-Ws) and J c (i, s) = C~ 2 J(t, CA'^s), 
and i]q is defined as r] 2 o but changing ip by The first thing to note 
is that the support of rj* is contained in (1/2,4) and the second is 
that i]* just depends on F", A and rj. It is easy to show that rf and 
its derivatives are bounded. For the rest, we have that the support 
of 4>c(t,-) is contained in (1,2); moreover -j^Jc x 1) and the partial 
derivatives of both ipc and Jc in t and s are uniformly bounded. Then, 
by integrating by parts several times, one shows that (^)^c(^) <Cj C^ 1 
and then (f)V0) <j 1 □ 

In order to prove a quantitative version of Dani's Theorem for the 
discrete orbit Tgh(s) n , n = 0, 1, ... N, we split the analysis in three 
cases: when the piece of orbit is near to a closed horocycle (9t and 1 
small, T = sN), when it is far from any closed horocycle (0t and y^ 1 
large), and the intermediate case. 

We begin by the "near" case. Here, one can parametrize the orbit 
essentially as Yh{p{n))a{yT) with p(n) a quadratic polynomial. The 
following allows us to handle the distribution for such a sequence. 

Proposition 4.8 (Near to a closed horocycle). Let < 5 < 1/2. The 

sequence s n = a + f3n + um 2 + iy, n < N is 6 -equidistributed with respect 
to the algebraic measure on the closed horocycle of period y~ l unless 
either mm(y 2 /Nu>, Ny) < 5~°^ or there exists a natural number q 
smaller than (5/r(q 2 ))-°^(l + y~ 1/2 ) such that 

(4.2) HHI + N\\q/3\\ + N 2 qu < yh(5 /r(q 2 ))-°^\ 

where q 2 is the denominator in the expression as reduced fraction of 
W/q- 

Proof. Let us suppose that s n ,n < N does not satisfy the condition 
in the statement and that both y 2 /Nu and Ny are larger than 5~ c for 
c > 1 a large constant. Then 

\E n < N f(a + /3n + ton 2 + iy)rj(n/N)\ > 5, 

for some / e C(X) with \\f\\ wj 5~ j and r) e C o °°((0, 1)) with 
\\v U) \\l™ <j Take M = y^S'^; there exists q 2 < M and a 2 
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coprime to q 2 such that 

P ° 2 + < ^ 

By splitting into arithmetic progressions modulo q 2 we have 

(4.3) 5-°V\E b < q2 E J < N/q J(a + ^ + eq 2 3 + uqj3 2 + iy)r ] (-^)\ > 1. 

Since <j with F{t) = f(a + a 2 b/q 2 + eq 2 t + uoq 2 2 t 2 + 

w)v(jnq^)i we can see (for instance applying Poisson Summation an 
integrating by parts) that it is possible to substitute the inner sum by 
the integral 

(4.4) 5~ |E 6 < 32 f f(a + — + eNt + uN 2 t 2 + iy)rj(t) dt\ > 1. 

J <?2 

At this point, let us remark that from the beginning we could assume 
the support of r\ to be at distance 5 from the point —e(2uN)^ 1 . By the 
spectral decomposition (13.291) and the Fourier series expansion (taking 
into account that the zero coefficient vanishes) we have 

r o(i)| y ^^ e (ma)W f (my)E b<q2 e(—)I(rn)\ > 1 
m ^o VH ' <?2 

for some n with < and / 6 V n with \\f\\ W i <j <5~° (i) and 

J( m ) = / n(t)e(mN(et + uNt 2 )) dt. Then 

5 -Oii)\Y^^^-e{kq 2 a)W f {kq 2 y)I{kq 2 )dt\ > 1. 

fc^O Vl?2«| 

By the multiplicativity of the Hecke eigenvalues 

(4.5) K(ab) = Y n{d)K{a/d)K{b/d) 

d\(a,b) 

we have 

s-o(i) \M^MA \ y ^M e{mdq2a ) W j mdq2 y) I{mdq2 ) dt \ > J_ 

V^2 f^ n Vrn - r(q 2 ) 



for some d \ q 2 . By our previous remark on the support of rj we have 

1 



6 -o(i) \M^Bi \ Y ^0-e(mdq 2 a)W f (mdq 2 yU(mdq : 



2 U > 



with u = max(|eiV|, uN 2 ) and ip a smooth function with rapidly de- 
caying derivatives 

^\x) < j>fc 6~ j (l + \x\)- k k,j > 0. 
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Thus, if Ti is discrete, splitting into dyadic intervals and using (I3.47p . 
Proposition 13.11 (with its remark) and (I3.37P we have 

S-° {1) (q2/d) e (dq 2 y 1 / 2 min(l, yu^) 1 ' 2 > rfe)" 1 

so that q 2 < 5^°^ and u < y5~°^\ which implies the conditions in 
(14.21) . If 7r is continuous, again by Proposition 13.11 and (I3.50P — with 
the remarks after them — we have 

5-° {l \dq 2 y l l 2 ^y min( ) ! ) > r(g 2 )" 2 

q*dq 2 {y + u) \\q*dq 2 a\\ 

for some q* <C (dq 2 (y + w)) -1 / 2 . Therefore for some d* \ q 2 

q*d*q 2 < (5/r(q 2 ))-°^y-^l + uy' 1 )- 1 

and ||g*d*g 2 a|| < (5/r(q 2 )y°^y 1/2 . These conditions implie f fl~2]) . □ 

We now deal with the intermediate case. In the proof we simply use 
bounds for the Fourier coefficients. 

Proposition 4.9 (Intermediate case). Let < 5 < 1/2, N > 1 and 
g G G. The sequence Tgh(s) n , n < N is 5-equidistributed w.r.t the 
probability measure carried by Tgh(t), t G [0, T] unless some of the 
quantities T^s" 1 , \W T \- l S - 2 T 2 y- 1+2e - e and yrTs" 1 + y 2 T /5 \W T \T- 1 is 
smaller than 5^° ( - 1 \ 

Proof. Let us assume for simplicity that Wt > 0. If Tgh(s) n ,n < N is 
not 5-equidistributed w.r.t. the measure carried by Ygh{t),t G [0, T], 
splitting the orbit (12. 5ft we have that 

\E neI f(h(x sn )a(y*))ri(n/N)\ > 5, 

for some y* < y T 5-°^\ f G C(X) with f(t + iyjdt = and 
ll/lkip — 1; V £ Co° with derivatives bounded by 5~°^ and r)(-/N) 
supported in an interval I of length satisfying iVi/l -1 < 5~°^ such 
that 

x t = a + yW(l ±t/WY\ 

with W = W T , y = yr, and 5"° (1) |1 ± sn/W\ > 1. Now, using 
the Fourier expansion of f(t + iy*), the spectral expasion of / and 
Proposition 13. II we get, for some c in [y 1 ^ 2 , §~°^}, that 

r°« £ K(m)||E ne/ 7K^)e(ma;, n )l » {cy' 1 ) 1 ' 2 

mE J(cj/ _1 ) 

with J(w) = [it, 2m] for some 7r with A„- <C 8~°( l > . Now, considering 
that x sn is invariant by the transformation that sends (y _1 ,s, W) to 
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(cy -1 , cs, cW), we can write 

r°« \K(m)\\E neI r ] (^)e(mx sn )\>(y- 1 ) 1 / 2 . 

m£ J(y~ 1 ) 

with the same conditions for y and W as those in the statement. By 
Poisson summation in the inner sum, integration by parts and Station- 
ary Phase we get 

<T°« \K(m)\>T{Wyy 1 ' 2 s~ 1 

me J' 

where J' is an interval contained in Jiy -1 ) of size 0(1 + T/Wy). So, if 
T/Wy < 1, by the bound \\ n (m)\ < m 9+ °^\ we get 5-0(1)^-0+0(1) > 
Til/Wyf^s- 1 and then S'^Wy 1 - 26 ^ > T 2 s~ 2 . If 1 < T/Wy < 
y~ 3 ^ 5 , by Cauchy's inequality and (I3.5ip we deduce that $~°^y~ 3 ^-^ 

is larger than T 2 (Wys 2 )~ l so that S~°^s > T^yTo > T^~t& = T%. In 
the case T/Wy > y~ 3 ^ 5 , from Stationary Phase, (13.511) and Cauchy's 
inequality we get 

E \J2 a M^^)e(2W( m yj/ S y/ 2 )\ 2 > T 2 W~i 

with aj = e(-W(j/s) 1/2 ). Then, either 5-° {1) sT/yW > T^" 1 or 
I>( ^^)Hrny)emrny) l/2 )\ > Ts^ 2 

m 

for some ip G C£°, a* x 1 and < e < 1. 

The first possibility implies that > Ty. The second possibil- 

ity implies, writing r)(x) = J fj(9)e(x9) d9, that 

r 0(1) | J2^( m y) e ( md ) e ( eT ( m y) 1/2 )\ > T S - 2 e- l/2 

m 

for some 6 1, which applying Poisson summation and Stationary 
Phase gives 

r O(l) [(eT) l/2 +r l (eT) -l/ 2] > Ts -2 t -l/2 

so that 5-°^s > T 1 / 4 . □ 

Now we treat the case in which the piece of horocycle is far from a 
closed one. The previous result worked for y T > s/T, that is almost for 
the whole range. To handle the remaining range, we are going to split 
the piece of horocycle of length T in pieces of length eT, e < 1. Then 
we shall put those pieces in the fundamental domain Dx, e T, and then 
use the Fourier expansion (as in Proposition 14. 9p . The advantage is 
that now we can get an extra cancellation from the fact that different 
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pieces are essentially uncorrelated, and we can control this indepen- 
dence arithmetically. The ideas of the method come from the theory 
of exponential sums; it is essentially a modification of the one in |23j . 

Proposition 4.10 (Far from a closed horocycle). Let < 5 < 1/2, 
N > 1. The sequence Tgh(s) n , n < N is 5-equidistributed w.r.t the 
volume measure on X unless some of the quantities T 1 / 5 ^ -1 , y^sT' 1 
and \Wt\~ 1 Ti/ t 2 ^ j is smaller than 5~°^ — with T = sN. 

Proof. Let us assume for simplicity that Wt > 0. It is enough to 
prove the result changing the three quantities in the statement by the 
following ones; 

(4.6) T l ' h s-\ (sT)- 2 ^y-\ {T / sf^W'^ 1 , 

y = yT, W = Wt, which are more convenient for our argument. If the 
Tgh(s) n ,n < N is not 5-equidistributed w.r.t. the volume measure on 
X, then there exists / G Ll(X) with ||/|| L i P < 1, \\f\\ws <j and 
an interval / of length satisfying X|/| _1 < S~°^' such that 

\^neif(H x sn)a{y*))\ > 8 

where y* < S~°^y, 5~°^\1 ± sn/W\ > 1 for any n 6 I and 

x t = a + yW{l ±t/W)-\ 

We can take a such that \a + yW\ < 1/2. Now we divide the sum into 
sums such that the argument in h(-) is near to a Farey fraction a/q up 
to q < K. Precisely, we are going to take the interval around a/q 

1 a ^ 1 , 
q q(q + <?')' <? q(q + q") 
where q' and q" are the denominators of the Farey fractions (up to K) 
to the left and right of a/q respectively. The point is that both q + q' 
and q + q" are comparable to K. In this way, we can write 

a/q 

where a/q ranges over coprime integers a, q with q < K, and f) a / q 
are functions with ||^„/ g ||z°° 5~°^ with uniformly bounded 
support. In this way, we have 

| ^E neJ f(h(x sn )a(y*))rj a/q ( X ™ ^_ g )| > 5 

a/q 

where q < K, a moves in some subinterval of a yTq of density 5 C 
for some c > 0, and n moves in the interval where (x sn — a/q)qK is in 
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the support of fj a / q . From here, splitting the support of fj a / q and taking 
into account that the fractions with q < 5 2 K are negligible we deduce 
that 

\^^ neI f{h{x sn )a{y*))ri a / q { Xsn l " g )| > 5 

a/q /q 

where \q — q \ < 5 C K for q > 5 2 K and c > large enough, r] a / q 
satisfying the same conditions as fj a / q but with support in an interval 
—independent of a/q — of length 8° for c > a large enough constant. 
Moreover, we can assume that this support is at distance at least S c 
from zero, for some d > 0. We can rewrite it as 

| y2E neI f(h(- + -^)a{y*))r}a/ q {v S n)\ > $ 

a/q 

with v t = q 2 (x t — a/q). But multiplying by 7 a / g G V as in the proof of 
Lemma [2791 we have that j a / q h(a/ 'q + v j V) a (z/*) equals 

h y 2 i 1 2 \2 > a y 2^(2 \2 > k (~ arccot ~^—) 

q v M + [q y*) v + {TV*) TV* 
which, unless (K 2 y)~ l < 8~°^ l \ is at distance 0(5 2 ) from 

K-- - - + r )a(K 2 y ) 
q v 

for some y < S~°^y and r , both independent of a/q. Then we have 
r°( 1 )|E a/g E n /(/ i (-- - — + r )a(K 2 y )) Va/q (v sn )\ > 1 

q Van 

and n moving in an interval of size 5°^ 1 \syK 2 )~ l that contains the n's 
with v sn in the support of r) a / q . The Fourier expansion of / gives that 

f{m,K 2 y )e{mr )E a , q e{-— )E n e{-^-)r] a ^{v sn )\ 

is at least c°^\ with f(h(t)a(y)) = ^7J m f(m, y)e(mt), J(u) = [u, 2u] for 
some c with 5°^ l \K 2 y) 1 / 2 < c < 5^°^\ Thus, by Cauchy's inequality, 
Proposition 13.11 and (13.481) we have 

r° (1) E meJ( ^- 2c?/ - 1) |E aig e(-— )E n e(-— )r/ a , 9 (^ n )| 2 > (K^cy" 1 )- 1 . 

Now, since yT = ysN and v sn are invariant by the change (s,?/ -1 ,^) 
to (as, q/ -1 , cW), and the conditions (14. 6ft only improve, then we can 
write 

r° (1) E meJ(K - 2y -i ) |E a , g e(- — )E n e(- — )r7 aj(? (^ n )| 2 > (iTVT 1 - 

9 ^sn 
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By Poisson Summation in the n-sum and Stationary Phase we get that 

c- n/iu ,^ / am_ , . Wy „, m .. , ,myK 2 .. 9 

6- ° (1) E me j {K -^) |Ea,,e(- T )E, 6 f e (-^0(_)),£ /ff (_^_) | 2 

is at least s~ 2 , with s > 0(v) = -2(wj'/s) 1 / 2 + v/W, and 6 a/w - 

independent of m, |& a /g,j| = 1; j moves in an dyadic interval of size 
$-o{i) s _ Expansion of the square followed by Poisson Summation in m 
and Stationary Phase gives that either 

(i) the proportion of (x,x') with x = (a,q,j) such that fl and A 
are both at most 5~°^K 2 y is larger than S°^s~ 2 

(ii) or the proportion of (x, x') such that Q x A x U times 
5~°( 1 \K 2 yU~ 1 ) 1 / 2 is larger than 5°^s~ 2 for some U of size 
at least 5~°^K 2 y, 

where 

Si = maxdl^ - \\^^\) A = |(£)V» - 1 - fWz^£|. 

g g (WyKy j a — a/q 

The maximum appearing in f2 comes from the fact that the second 
term is never comparable to the first one. The proportion of terms in 
the diagonal x = x' is D = 8~"°^ 1 \K 2 ysT)~" 1 . On the other hand, the 
proportion of terms with Vt x A x 1 times 5~° ( - 1 \K 2 y) 1 / 2 is certainly 
smaller than E = 5~°( 1 \K 2 y) 1 / 2 . Now, choosing K 2 y = (sT) -2 / 3 , we 
get D = E = 5 - °( 1 )(sT) -1 ' 3 , which is certainly smaller than S°^s~ 2 
unless T 1//5 s _1 is bounded by 5'°^. If we show, assuming that any 
of the quantities in ( 14. 6 p is larger than 5~ 0( -^ , that the contribution of 
the rest of terms (x, x') to either (i) or (ii) is smaller than either D or 
E, we are done. 

First, let us consider the terms \\a/q — Q>'/q'\\ x (7 < 1 for some 
U ^> 1/K 2 . These are in case (ii), and their proportion is 0(U), so 
they contribute less than U{K 2 yU^ 1 ) 1 ^ 2 <C E. The rest of terms satisfy 
a/q = d jq 1 so q' = q and a' = a + Xq for some A <C yT, and then 

Q = \\\(WyKy 2 , A = |(j7j) 1/2 - 1 - A(a - a/q)^. 

If / = j, then A ^ 0, A ^ O(fi) and A x \\\(Wy)- 1 > (Wy)~\ so 
that by gl]) we have A > (s/T) 13 / 20 > r°«(sT)- 2 / 3 > 5-°^K 2 y. 
Therefore, we are neither in (i) nor in (ii). If j' ^ j, we have to look 
at cases (i) and (ii). In any case, it is going to be enough to fix q and 
the residue of a modulo q, so that 



AH07i) 1/2 -i-A(s-/i)- 
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with a x yW and fi an integer moving in the range fi <C yT. Moreover, 
let us fix the range A x eyT for some e < 1. Then Q x eTW~ 2 (K 2 y)~ 1 
and A = j'/j - 1 - 0(eTW- 1 ) so x eTW~ 2 (sT) 2 / 3 < eTW~ l . 

Consider first the case (ii). On one hand, the proportion of 
involved is 5~°^eTW~ 1 independent of A and On the other hand, 
for fixed j, f, A, we can see that there is at most one /i such that A x Q. 
Therefore, the contribution of these term is bounded by 

5 -0(D K -2( eTW -l^ yT yl( K 2 yn -iy/2 < ^/^ K 2 yT) -l/2 E 

which is smaller than E. Finally we examine case (i). Since A <C 
K 2 y = (sT)~ 2 / 3 <C s _1 , we again get the same proportion for (j,f). 
This time, for fixed j, j', A, we can see that either there is just one \x such 
that A < 5~° {l) K 2 y or their proportion is 0(K 2 yW 2 T- 2 e- 1 ). The first 
possibility gives a contribution like the one before with K 2 yVL~ l = 1, 
and then we already know it is negligible. The second gives a contri- 
bution of 

S-oWR^ieTW-^iK^W'T^e- 1 ) = S^Wy^ 1 
which by gSJ) is 0((T/ s) 13 / 20 ^ 1 ) and then is bounded by D. 

□ 

Remarks. (a) It would be possible to improve the range for s a 
little by treating non-trivially the exponential sums appearing in the 
proof in the last application of Stationary Phase. This would improve 
the range also in Theorem 14. 1 1 1 

(b) We could prove this result also (with a smaller range for s) 
by proceeding as in the proof of Theorem 14.91 but treating the sums 
J2 n X 7r (n)e(F(n)) with van der Corput's Lemma and shifted convolu- 
tion. One could also prove it in a quicker way (again with s smaller) 
by relating the discrete orbit to the continuous one as in Lemma 3.1 of 
[39] , and then using Theorem 14.51 

Now we join the previous cases to prove an effective version of Dani's 
result for sums 

Theorem 4.11 (Effective Dani, discrete orbit). Let £ G X fixed. Let 
< 5 < 1/2, N > 1 and s > 0. There exists a positive integer 
j < 5-° {l) such that the sequence £h(s) n , with n is any subinterval of 
n < N of length N/j, is 5-equidistributed w.r.t. a non-discrete algebraic 
measure on X unless either (sN) 1 ^ J s^ 1 < 5~°^ or the conditions in 
Lemma lKVi are satisfied. Moreover both j and the measure are the ones 
in Theorem \4-6] — the continuous case. 

Proof. Let £ = Tg. For s < <5 _0( ' 1 - ) A^ _1 the result is trivial; otherwise, 
by Propositions 14.101 and 14. 9[ and by Theorem 14.61 we get the result as 
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long as W T < <T° (1) S - 2 T 2 y 2 T e ' 1+o{1) , with T = sN . If we are outside 
of that range, we have that 

\W T \ >8-°W{N5+Ns 2 y;J f ). 

Then, we have 

Ygh(s) n = h(x + y T sn + y T —) a (y T ) + 0(S 2 ) 

Wt 

and we can apply Proposition 14. S\ which gives that l;h(s) n ,n < N is 
5-equidistributed w.r.t. the algebraic measure on the closed horocycle 
of period y^, 1 unless there exists an integer q < (5 / T(q 2 ))~°^ y^^ 2 such 
that 

\\qx\\ + iV|| 9 sy r || + (sN) 2 qy T /\W T \ < y\{5 /r(g 2 ))-°« . 

Actually we must have q > (S/T(q 2 )) ^y T 1 ^ 2 because £ is fixed. But 
these are the conditions in Lemma [2.91 □ 

Remark. The proof actually shows that £h(s) n ,n < N is 5- 
equidistributed w.r.t the measure carried by £h(t),t G [0, sN] unless 
either (siV) 1 ^ -1 < 5~°^ or the conditions in Lemma 12.91 are satis- 
fied. 

As a corollary of this result we obtain Dani's Theorem for discrete 
orbits. 

Corollary 4.12 (Dani Theorem). Let £ G X with a irrational. Then 
£h(s) n , n < N becomes equidistributed w.r.t the volume measure on X 
as N goes to infinity. 



5. Effective equidistribution: discrete algebraic 

measures 

In the previous section we gave necessary conditions on £ for the 
probability measure carried by any large piece of the orbit £h(s) n , n < 
N to be near to some non-discrete algebraic measure. This is all we 
need in order to prove our results for primes. However, it remains 
to answer the following questions: is it possible for that measure to be 
always near to an — either discrete or non-discrete — algebraic measure? 
Are the conditions on £ really sharp for the measure to be near to a non- 
discrete algebraic measure? In this section we will show that the answer 
to both questions is essentially "yes", as long as (sNy^s^ 1 > 5^°^. 

Theorem 14.111 says that, for (sNY^s^ 1 > 5^°^, if the sequence 
£/i(s) n , n < N is not equidistributed w.r.t. a non-discrete algebraic 
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measure, then is near to a sequence 

A + Bn . e! + e 2 f + e 3 (f) 2 + ie 4 



(5.1) s n = he 4 



q q 2 



-1\0(1) 



for some integers A,B,q with (A,B,q) = 1 and ej <C (T(g 2 )<5 
with | = 2a, (a 2 , g 2 ) = 1- We also have e 4 /g 2 > (1 + s)" 2 ~ o(1) from the 
remarks after Lemma 12.91 A key ingredient to understand s n is going 
to be to understand its restriction to an interval L < n < L + q; there 
it is very near to a sequence 

, x A + Bn W/ Mi M 2s 
5.2 + M 2 (— -+i^) with n mod g 2 , 

where Mi,M 2 < r(g 2 )°( 1 ). We can describe the points 

A + Bn 

n mod g 2 

Q 

mod 1 in a more convenient way. First, we can write 

A + qia 2 n 

n mod g 2 

with q 1 = q/q 2 and then (A, qi) = 1. Since a 2 is coprime to g 2 we can 
write 

A + qin 

n mod g 2 . 

<?1<?2 

Next, we can write g 2 = g 2 g 2 with g 2 coprime to q\ and gi a multiple 
of every prime dividing g 2 '. Since (gi,(? 2 ) = 1, the equation 

A + qix = mod q' 2 

has solution in x, and then we can write 

q' 2 h + q x n 

with n mod q 2 

with (h, qi) = 1 and then (h, q\q 2 ) = 1. Finally, writting m = vq' 2 + uq' 2 ' 
with u, v integers the points in (I5.2p can be described as 

u v h te .Mi M 2 . , . , .. 

"T + — + T, + M 2{^- + ^) u mod q 2 , v mod g 2 . 

q 2 q 2 qW 2 r r 

To further study the behaviour of these points in X it is natural to split 
them into classes G\ for any / a divisor of q' 2 , each C\ corresponding to 
the points such that (u, q 2 ) = I. 

For any point g of SL(2, M) of the form; 

,b , , Mi + iM 2 . 
g=(-+M 2 2 2 ,0 
g g 2 
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with (6, q) = 1, we can multiply to the left by %/ q G SL(2, Z), with 

76/9 = 



—6 * 
q —b 



obtaining 

(5.3) , = 76/?5 = (-- - + -2arccot — ). 

So in our setting it is easy to see that g ^ g* sends the points in C\ to 

-r2- 



l g 2 + Mf + Mf ^Mf + Mf' M 2 h 

where x is the inverse modulo gig 2 '- One can rewrite these points as 

( u 1 + ^1/^2 -2arccoA 

l g 2 + ?l(?2 ' Mt + My Ml + Mr M 2 h 

with ft, an inverse of h modulo q±. But then this is the same as 

(1 + ^ - Ml,M2 + 1 -2arccot ^) 
\ 2 qiq'i Ml + Mf M x 2 + Mf ' 2arCC °M 2 j 

with n mod g 2 , (ft, ^2) = 1- I n general, for any I \ q' 2 , everything works 
the same way but dividing q' 2 , Mi and M 2 by Z, and then the points of 
Ci can be seen as 

(5.4) (_ + 1 2,,_ 2arccot _ ) , 

with n mod (n,q' 2 /l) = 1 and 

g£ 2 /i Mi/Ma i 

^ = 77 - , r o . 7T^ + 



q iq ' 2 ' Ml + Mf Mf + Mf 

We are finally prepared to state the main result concerning the near 
to a closed horocycle case 

Proposition 5.1. Let < 5 < 1/2. There exists j < 5~°^ such that 
when n is restricted to any subinterval of n < N of length N/j the 
sequence s n is 5-equidistributed w.r.t. some algebraic measure in X . 

Proof. If |e 2 | + |e 3 | < 5~°( 1 )e 4 then s n and s m are at distance 0(S~°^\n— 
m\/N), hence the result follows — each algebraic measure is supported 
on a point. Thus, from now on we assume |e 2 | + |e 3 | ^> d^ ^ 1 ^^. 

Let us treat first the case q 2 <C 5^°^; if n is in an interval J of 
length 5 C N containing the point N' (we should choose N' such that 
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t = N'/N makes \e\ + e 2 t + est 2 1 ^> |ei| + [ j + | £3 1 ) , we select the 
sequence 

A + Bn e 1 + e 2 f + e 3 {f) 2 + te A 

r n = h e 4 — n mod pq 2 

pq [pi) 

for any prime p > 5'°^, which carries an algebraic measure /1. Let us 
see that s n in the interval J is <5-equidistributed w.r.t. \i. For that, let 
us split J into arithmetic progressions n = n mod q 2 ; for each one we 
have (due to ( 15. 31) ) 

Ys n = T(x + — - -2 arccot — ) 

e 4 & 2 + e { b 2 n + ej e 4 

for b n = + e 2 jj: + ^3 (77) 2 an d x o an d qo \ q depending on no- Then 



for most ra's in the arithmetic progression contained in J. Since 
I 1 1 < 5~ Ws 2+0 W < iV ^) by classical methods in exponential sums 
one can prove (see for instance 



e 4 1( ?0 

IE n gJ jn = no mod q2 e (^ ?2 ) ^ ^ 
n 

for large c' and any k < 5'°^. Then, we have that s n , n G J, n = 
n mod g 2 is 5-equidistributed w.r.t. the measure carried by 

t + i-§- te[0,l]. 

On the other hand we have, again by (15.31) . that r n with n G J, n = 
n mod g2 is 5-equidistributed w.r.t. the measure carried by 

( x ' o + ^ _ ^ % + i % -2 arccot ^) m mod p 

for some x' depending on tiq. Since any two consecutive points of 
this sequence are at distance p~ l 5~°^ < 5 2 , we have that r n n G J, 
n = n mod is 5-equidistributed w.r. t. the measure carried by 

t + i$- te[o,l), 

so s n , n G J is £-equidistributed w.r.t. fi. 
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It remains the case |e 2 | + |e 3 | > 5~°^e A and q 2 > 5~°^\ In this case 
we can take the algebraic measure \i carried by 

A + Bn t e x + e 2 f + e 3 (f ) 2 + ie 4 

r n = h e 4 ^— n mod q 2 , 

q q z 

to approximate s n , n G J. To see that s n , n G J is 5-equidistributed 
w.r.t. /x, we shall show the same restricting n to any subinterval of J 
of length q 2 ; since g < o" -0 ^?/ - ^ and y^ 1 <C sN we have that q < SN; 
then s n restricted to a subinterval of J of length g 2 satisfies 

s „ = ^ + £4 fi±i^±|g)l±^ + ow „ modfe 

g g 2 

with AT" G J. Now we restrict n further to the set Cj, for some I \ q' 2 , 
of n's satisfying (n, g 2 ) = Z; it is necessary to look just to the Vs with 
I < T(q' 2 )5~°( l \ because the others give a negligible contribution. Due 
to (15. 4p the restriction of r n to Ci is 

(5.5) ( — - + / X\ -pj o+i>To ^,-2arccot ). 

q 2 /l e 4 b 2 N ,+e 2 b 2 N , + e 2 ' e 4 

with n mod g 2 /Z, (n,q' 2 /l) = 1, and Xi = ^77. Now, it is easy to show 
that for any x, e > and j, d G N we have 

x<n<x+ej'(i,(n,d)=l n<jd,(n,d)=l 

Applying it for d = q' 2 /l, j = qijq 2 and e = 5l 2 /{b 2 N , + e 2 ), since 
ej(j)(d) S> 5~° ( - 1 - ) r((i), we have that the sequence in ( 15.51) is 5-equidistributed 
w.r.t. the measure carried by 

I 2 b 

( t + i U 2,-2arccot— ) te[0,l]. 

b N i + e 4 64 

One can proceed in the same way for the restriction of s n , obtaining 
that it is 5-equidistributed w.r.t. the measure carried by 

I 2 b]\j" . 

(t + irrp g' - 2arccot ) £G|0,1|. 

b N „ + e 4 e 4 

But, since both N' and N" are in J we have that both measures are 
similar, and then s n , n G J is 5-equidistributed w.r.t. /1. □ 

As a corollary of Theorem 14.111 and Proposition 15.11 we obtain 

Theorem 5.2 (Effective equidistribution). Let £ G X fixed. Let < 
5 < 1/2, N > 1 and s > 0. There exists a positive integer j < 5~°^ 
such that the sequence ^h{s) n , with n is any subinterval of n < N 
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of length N/j, is 5-equidistributed w.r.t. an algebraic measure on X , 
unless (sNf/^s- 1 < <?-°W. 

Remarks, (a) As shown in the proofs of this section, in the quan- 
titative setting the discrete algebraic measures are much more com- 
plex than non-discrete algebraic ones. Therefore, to be near to a non- 
discrete algebraic measure is a condition that is much stronger than to 
be near to a general algebraic measure. 

(b) This result gives control over pieces of orbits of the discrete horo- 
cycle flow for s not very large. It is possible to show that this control 
fails for s > N 3+€ ; in fact, taking x = 0, y = q~ 2 , s = Aq~ 2 and 
W~ x = A~ 2 q~ 3 for A, q natural numbers, q < N € , A > N 3+e we have 
that gh(s) n is very near to the periodic sequence 

2 

n i 

1 — - n mod q. 

q q l 

One can show that for certain g's the measure carried by this sequence 
is not near to any algebraic measure. The same happens for Theorem 
14.111 Perhaps both results remain true for s < by substituting al- 

gebraic measures by "polynomial algebraic measures" — and changing 
the conditions in the statement of Theorem I4.1H — , meaning any mea- 
sure carried by a periodic sequence Th(p(n))a(y) with p a polynomial 
of degree 0(1). 

6. Large closure of prime orbits 

In this section we are going to deduce Theorem 11.11 from an upper 
bound for the sum 

p<T 

In order to get such a bound we make use of sieve theory, in particular 
the following special case of a Selberg's result (see [2TJ Theorem 6.4]) 

Lemma 6.1 (Upper bound Sieve). Let (a n ) n <T a sequence of non- 
negative numbers. For any d e N write 

\a + r d . 



E 



• d 

n<T 
n=0 mod d 



Then, for any 1 < D < T we have 



J2 q p^ , A rn + J2 T3 ^\ rd ^ 

Jr^ 1 log \JD f~Z, 

VT<p<T to d<D 



withr 3 {d) = J2d 1 d 2 d 3 =d 1 - 
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So the task now is reduced to have tight control over the sums 

m<T/d 

for most of the d's in a range 1 < d < D with D as big as possible. 
This can be handled by the Theorem 14.111 whenever D < T 1 / 5 . The 
precise result that follows is 

Theorem 6.2. Let (6l with a irrational, f > and s > 0. Then 

(6.1) E p<T f({h( S y) < 10 / /d// G + 0T (l)||/|| Lip . 

Jx 

Proof. Let ||/||Lip = 1- Let us begin by dealing with the case in which 
the conditions in Lemma 12.91 (i) are satisfied for £, s, N = T and 
5 = (\ogT)~ A , A a large constant. Take q < 5~°^ the smallest integer 
satisfying the conditions in the lemma. It is easy to check that q has 
to go to infinity with T; moreover by Lemma 12.91 (ii) and the remarks 
after the lemma one sees that £,h(s) n is at distance S from a ^-periodic 
sequence, with q 2 \ q[q^} 2 q 3 < 5^°^, when n is restricted to any 
subinterval J such that \ J\ = 5 C T for some constant c. In this situation 
we can apply the Siegel-Walfisz theorem (9} page 133] for primes in 
arithmetic progressions to deduce that 

n<T,(n,q 2 )=l 

/(^( S )") + 0((logT)- i ). 

Finally, the condition in Lemma 12.91 (i) cannot be satisfied changing s 
by sd, N by N/d if we put 5* = q~ e and d < q e for some small e > 0; 
then we can apply Theorem 14.111 to deduce that 

(6.2) E n<T ^ 0modd f(£h(s) n ) = j fdfi + 0(q- e ), 

for any d < q e , where d[i is the average of the algebraic measures 
appearing in the statement and then it is independent of d. Therefore, 
using the identity \ m =\ = ^2d\m M^) an d (16.21) for d < q e we have 

E n<T , M=1 mh(sT) = E n<T f(Zh(s) n ) + 0(r(<fc)<r e ) 

and then the result follows from Corollary 14. 121 

Now let us suppose that the conditions in Lemma 12.91 are not satis- 
fied. Then, applying Theorem 14 . 1 1 1 we have 

E n<T fms) n ) = J fdfi + 0((lo g T)- A ). 

with \i the average of the algebraic measures there. Let us suppose 
that for any D < Dq = T 1 / 5 and for any d in D < d < 2D but at 
most 0(SD) exceptions, the conditions in Lemma [2T91 are not satisfied 
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changing s by sd and N by N/d. For any of the non-exceptional d we 
have (again by Theorem 14. 11)) 



E n<T ,„ s o m od d /(^(s) n ) = J fd(i + 0(QogT) 
Therefore, applying Lemma [6. II we get 



E p<T f^h(sY) < WE n<T fms) n ) +0((logT) 



0(1)-A^ 



and we are done by Corollary 14.121 So, we have finished unless 



\\dq 



R/i 



Dt° {1) 5-°Ut-\ 



dq a 



Rd 



for more than 5D d's in D < d < 2D for some D, where Rd = R{^d9o)i 

a d = a(ld9o), Tfifo = C, Id < r° (1) 5~°W, and r d = r((g d ) 2 ). Let us see 
that this cannot be true for any D < T x ~ e , e > 0. 

Let us assume it is true. We know that < L = D°^\ Then we 
can split [1,L] into at most O(loglogL) intervals of the shape [t, 2t 2 ], 
and there will be at least <5L>(loglog L)^ 1 ^> 5 2 D efs with in one 
of them; let us consider now just those d's; for any of them we have 
T * < r d < T 1 for some fixed r* < L. 

This implies that 7d = 7 and qd = q for a set .4 of more than D/M 
d% with M = rf m 6-°W. So for them 



DMT~ 



ft 



DM(sT 2 



and then effective equidistribution in the torus (see Lemma 3.2 in [13] ) 
implies that 

s 



(6.3) 



IN 



< MT~ 



for some h < M. We can assume h is coprime to [hqs/R]. Now, since 
M~ 2 < DMT' 1 it is easy to check that h \ d for any d G A. But then 
d = Xh, we have [c?gs/i?] = X[hqs/R] and 

||A[fcgs/#HI < DM(sT 2 y l 

for more than D/M A's with A < D, which again by effective equidis- 
tribution in the torus gives 

\\j[hqs/R]a\\ < M(sT 2 )- 1 

for some j < M. Now j [hqs/R] = [j hqs/R] by (16. 3p . so choosing 
9* = jq < M we have 



(6.4) 



i? 1 



MT" 



R. 



ft 



M(sT 



2\-l 
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Now, it is easy to check that for any d £ A we have 

which implies that (qd) 2 = ^ <[<?*-§] ^ <?*• Since r* < r(fc) < r 2 we 
arrive at 

fc « q * < t° «r°« < T(fc) «r°« 

so < (J -0 ^ and then r* < (J -0 ' 1 ). But then (j6.4p means that the 
conditions in Lemma 12.91 are satisfied for the original sequence, which 
is a contradiction. □ 

Remark: Theorem 14.111 was not strictly necessary to prove this result 
(and then Theorem [LTJ); one could proceed in a more direct way, taking 
advantage of the extra average in d in Lemma 16.11 Anyway, it seems 
difficult to get a much better level in Lemma T6. II that way. We did not 
do it that way because we think Theorem 14.111 is interesting by itself. 

Finally, let us see that Theorem 11.11 follows from Theorem 16.21 Let 
x £ X generic, and v x an accumulation point for the sequence (n XtN ) N 
in C*(X*), where X* is the one-point compactification of X. Take 
/ £ C*(X*), f > 0. By approximation, we can assume that / as finite 
Lipschitz norm in X, so that Theorem 16.21 gives 

/ fdu x < 10 / fdfia, 
Jx Jx 

and we are done. 

7. Density of the Hecke orbit 

In this section we are going to prove a stronger result (Theorem 
11.31) for the special orbit H^h(p), p < N, from which we can deduce 
in particular that it becomes dense in X when N — > oo. This will be 
possible because we can get a good level of distribution for linear sums, 
and above all because we can handle bilinear sums up to a considerable 
level. We input those bounds into the following special case of a sieve 
result from [TP] 

Lemma 7.1 (Asymptotic sieve). Let {a n } n£ N be a sequence of non- 
negative numbers such that a n r(n) and a n = A + c n for some 
constant A > and a sequence c n satisfying the "Type I condition of 
level a " 

(7.1) ^D<d<2D\^n<x/dC dn \ < (log^)" 3 

for any D < x a ~ e and also the "Type II condition of level 7 " 

(7.2) ^>D<d 1 <d,2<2D\^>n<mm(x/di,x/d2) c dm c d 2 n\ ^ Q-Ogx) 22 , 
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for any D with x^° sl ° sx ^ 3 < D < x 7 ~ e , for any fixed e > 0. Then we 
have 

(7.3) \E p<x a p - A\ < c(a^)A + 0(e), 

with c(a, 7) an explicit decreasing function, such that c(l/2, 1/3) = ; 
c(l/2, 1/5) < 4/5 and c(l/2, 7 ) = 1 /or some 7 G (1/6, 1/5). 

Moreover, in the summations we can assume that d is square free 
and d\, d 2 are primes. 

Now, we are going to check the Type I condition. 

Proposition 7.2 (Bound for Type I sums). Let f with j x f = 0. We 
have that 

E D<d<2D \E n < N/d f(H N h(dn))\ < (logiV)- 3 ||/|| Lip 
for any D < N 1 ^ 2 ^ 9 ^ 6 , for any e > 0. 

Proof. Let us assume ||/||Lip — 1> an d suppose the result is false. Then, 
we have 

Dn 

E D<d<2D \E n f(TH N h(dn))r](—)\^5 

for 5 = (logiV)~ A , A a large constant, with 77 e C£°(— 2,2) with 
II 7 ? 5 II/IIwj • We have the Iwasawa parametrization 

H N h(dn) = hidnN'^aiN- 1 ), 

and then by the Fourier expansion f(h(x)a(y)) = J2 m /( m ; y)^(mx) 
and Poisson Summation we have 

Ez.^^l/KiV- 1 )!^^^})! »5 

m 

so that from Proposition 13.11 we have 

r o ( i )jD - liV -i/ 2+ * 2 j2 r(jm^{jj})\»i 

j<^8~ 1 DN 

which taking into account the decay of fj gives a contradiction. □ 

Let us go with the Type II condition 

Proposition 7.3 (Bound for Type II sums). Let f\,f 2 be continuous 
functions in T\G with ||/i||Li P = ll/^llup = 1 and f x fi — . Let 

^(loglogJV)- 3 <D< N (l-2e)/(5+28)-e and d < d x < d 2 < 2D, with d u d 2 

primes. Then we have 

E n < m i nWdl!N / d2 )fi(H N h(din))f 2 (H N h(d 2 n)) < (logiV)" 22 . 
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Proof. As in (4.1) we can replace the Lipschitz norm by Sobolev norms 
and hence if the result is false, we have 

Dn 

\^n<N/Dh{H N h{d x n))f 2 {H N h{d 2 n))r){—)\ > 5 

for 5 = (log N)' A 1 A a large constant, with 77 £ Co°(0, 1) with H^^Hl^ <c 

and ||.D/i||z,°° <^ordD S~~ ordD . By the Fourier expansion of /», 
fi(h(x)a(y)) = fi(m, y)e(mx), the bounds in Proposition EHJ Pois- 
son summation in n and integration by parts we have that either 

I £ / 1 (m 1 ,iV- 1 )/ 2 (m 2 ,iV- 1 )|»5 

or 

(7.4) I ^ / 1 (m 1 ,iV- 1 )/ 2 (m 2 ,iV- 1 )|»5 J D- 2 

for some k ^ 0, k < S'^DN. If the first possibility is true, we get 

i 

But from Proposition 13.11 we have ^(O^N^ 1 ) <C 5~°^N~^, and from 
the spectral expansion (13.291) of f\, the multiplicativity of Hecke eigen- 
values (14. 5 p and Parseval (I3.48j) we have 

(7.5) ^fxid^N-^U^N- 1 ) « {d 1 d 2 ) e ^, 

which gives a contradiction. 

Let us now assume that (17.41) is true. This is a shifted convolution 
sum, and we can proceed as in (3j Hj. We can translate it as 

1 1 1 

f 1 (h(d 1 x)a(—))f 2 (h(d 2 x)a(—))e(-kx)dx\ > 5D 2 , 



and further as 



f*{h(x)a{—-))e{-kx)dx\^>8D 2 , 



with /* = f d Jd 2 and 

f d (g) = f(a(d)ga(D/d)). 

fd is a T Q (d)— invariant function and since D/d x 1 we have ||.D/d||z,°° <C 
ll-D/IU 00 ! thus, /* can be seen as a function in T Q (did 2 )\G with Sobolev 
norms ||/*||iyj (did 2 )i +e 5~' J . Therefore, from Proposition ^. ll we get 
the bound 

f,(k, (DN)- 1 ) < 5-°^(DN)-^ +e (d l d 2 )^ 
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which gives a contradiction in our range for D. □ 

Now, assuming that 9 = 0, due to Propositions 17.21 and 17.31 we can 
apply Lemma 17711 with a = 1/2 and 7 = 1/5 for a n = f(TH^h(n)), 
and then 

Theorem 7.4. Assuming 9 = 0, for any non-negative f we have 
\E p<N f(H N h(p))- I fd^ G \<^ I /d/i G + o(l)||/||Li P 

From this result we can deduce Theorem II .31 as we did in the previous 
section with Theorem ll.il 

We end by discussing some improvements on the levels of distribution 
in Propositions 17.21 and 17.31 For the type I sums we will establish a 
level of 1/2 matching what Proposition 17.21 gives with 9 = 0. For the 
type II sums, Proposition 17.31 with 9 = 7/64 yields a level fo 25/167 = 
0.1457. . .. We establish a level of 3/19 = 0.1578 . . . which is a small 
improvement but still falls short of the magic number c (4 < c < |) 
which would render Theorem 17.41 to be unconditional. 

In what follows we assume that / is orthogonal to the Eisenstein 
series, because for them we have 9 = 0; moreover, for simplicity let us 
consider / .fT-invariant. The sums in Proposition 17.21 are (assuming we 
have a smooth sum, as we can) 

We can also assume that / is orthogonal to the space of Eisenstein 
series (because for them we know that 9 = 0), and the we can write 
the Fourier expansion 

f(h(x)a(±)) = M jjWkx) = 0(6) + M ^)< k *) 

k k 

where the sum in k is restricted to \k\ < 5~°^N and (k,N) < d^ ^. 
This is done by using the spectral expansion, the multiplicativity of 
A^(fc) and bounds for sums J2k ^Tr(k)ip(k). Thus it is enough to bound 

dxD k n 

Applying Poisson Summation in n yields 

(7-6) IM N-i)\\rj(^^)\ « i £>IM a?)'' 

db<D k,v k 
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with Cfc the number of d, t -C D such that dk = t mod N. For D < 
Ni~ e which we assume is in force, one can check that for each k as above 
Cfc <C N e as follows: if t = there are no solutions for the congruence 
since (k,N) < 5~°^; otherwise, for any pair of solutions (di,t±) and 
(0^2,^2) we have that d\t 2 = d 2 t\ mod N, so that d\t 2 = d 2 t\. This 
means that any solution is of the form (d,t) = \(d*,t*), for some 
fixed; moreover, A must divide N and we are done. On the 
other hand, we have J21 c k <e D 2 N e , so that J21 C T <c,m D 2 N e . 
From (I3.15P and noting that q = 1 we have 

|/(*. AT 1 ) < e * + M k )\ 

[t,-[<«-°w 

and hence 
(7.7) 

/V r ~2 +e * /V~5 +e * 

It is known (see [21] and [26]) that for x > 1, 

(7.8) £ |A,(A;)| 4 « e A;,r, 

[fc[<as 

hence 

(7.9) J* « 5 + r°«^^JV2Df = 5 + r°Wiv-2+^i 

This gives a level of distribution of 1/2 for these type I sums. 

For the type II sums y' 1 = ND, q = D 2 and we could try to use 
the improvement in Proposition 13.11 in the range ^fqy~ 6 > y~ J , i-e. 

D(ND) e > (ND)* or D > JVfew. For 9 = 7/64 this is D > iV 9 / 55 . 
However ^ > ^ so that the improvement in this range does not 
give an improvement of the level 25/167. Instead we again exploit the 
average over k and again we use [22]; for x > 1 

(7.10) |A,(m)| 8 « e (A,g) e x. 

\m\<x 
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For d\ , d 2 x D 

(7.11) II = ^J2v(^)fi(H N h(d 1 n))f 2 (H N h(d 2 n)) 

n 

£. V ( — \f (— 1 \ 

~ N 2^^ N '^N' ND> 

n 

<L»- e + E i/^t + z/iv^iVD)- 1 )!. 

|i|>l<-D 1+ ° (£) 

From fl37T5|) with |fc| < iVT> 1+e and for k = due to f[7~5j) we have that 
\l(k,(ND)- 1 )\<^D-+ i -^^- Mk)\\(f*,h)\- 

Hence 
(7.12) 

E (atz?)- 1 )! < E i(/*.^>i E i A i(*+ 

[i|,|i/[<£>l+« V? |<|,M<£> 1+e 

The inner sum may be estimated by Holder, 

E \W + uN)\<( E |A,(m)| 8 )l(L> 2 + 2e )l 

|t|,M<D 1+6 |m|<AfD 1 + E 

which by ( 17.10p is 

(7.13) <C \j(ND)*D*D°( e \ 

Substituting this into (I7.12p gives (recall that q = D 2 ) 
(7.14) 



i i 

2 



ii^d-*+ {nd) ~~ 2+ \ nd) 1 sd 7 h E K/*.^>l 2 )'9 

< ZT e + (ND)-^{ND)^D 1 i(D 2 )^N 0{e) = D~ e + N~*d¥ N 0( - e) . 
This gives a level of distribution of 3/19 for the type II sums. 
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